


THE 


m- eu 

design and analysis 



FACTORIAL EXPERIMENTS 



F. YATES, M.A. 


{Rothamsted Experimental Station) 


Technical Communication No. 35 of the 

CoMMON^VEALTK fiUREAU OF SoiLS 

Harpenden, England 



COMMONWEALTH AGRICULTURAL BUREAUX 
FARNHAM royal, bucks., ENGLAND 



CONTENTS 


1 . Introduction 

(a) Principles underlying factorial design. 

(b) Criticisms of factorial design. 

(c) . Scope of the present paper. 

(d) New material. 

(e) Notation, etc. 

2. A simple factorial experiment on potatoes 

(a) Yields of the different combinations of treatments. 

(b) Main effects. 

(c) Interactions. 

(d) Calculation of the main effects and interactions from the 

experimental yields. 

(e) Interpretation of main effects and interactions. 

(f) General remarks. 

3. Statistical analysis of a 2 x 2 x 2 experiment 

4. Confounding 

(a) Example to illustrate confounding. 

(b) Statistical analysis. 

(c) Presentation of results. 

(d) Example of partial confounding. 

(e) Sutistical analysis. 

(f) Presenution of results. 

5. Systems of confounding for 2 x 2 x 2 x . . . designs 

(a) Confounding with five factors. 

(b) Confounding with six factors. 

(c) Confounding with four factors in blocks of 4 plots. 

(d) General remarks. 

6. Estimation of error from high-order interactions 

7. An exploratory experiment on beans 

(a) Analysis. 

(b) Gain in precision due to confoundmg. 

8. Confounding in Latin square designs with factors at two levels . . 

(a) 2X2X2 design in two 4x4 squares. 

(bl Numerical example. . « o 

(c) ArrangemenU for five and six factora in an 8 x 8 square. 

q. Factors at more than two levels 

(a) Two factors. 

fb> Three or more factors. 111 

(c) Simplification when one of the factors is at two eve s on y. 

(d) Procedure when two or more factors arc at wo 

(e) Two factors at three levels : formal sub-division of interactions 

in a 3 X 3 table. 

(f) Example. 

,0. Confounding with thr« and four factors each at three levtU .. 

(3) 3 ^ 3 ^ 3 blocks of 9 plots. 

(b) Example of a 3 x 3 x 3 design. 

cl Adjusted yields of three-factor combinations. 

(d) 3 X 3 X 3 X 3 designs in blocks of 9 plots. 

(cl 3® and 3* designs in quasi-Latin squares, 
f) Extension to 3" blocks of 3 '‘-‘ or 3 "-‘’ 


PAGE 

4 


18 


23 


27 

27 

3 * 


3b 


42 




CONTENTS— 


II. 


12 . 



13 




The subdivision of sets of degrees of freedom 

(a) Subdivision of main effects. 

(b) Subdivision of interactions. 

(c) Example. 

(d) Genem remarks. 

The 3x3x3 design ; single replication 

(a) Systematic methi^ of analysis. 

(b) Alternative method. 

(c) The linear component of the three-factor interaction. 

Confounding with some factors at two and some at three levels 

(a) 3x2x2 design in blocks of 6 plots. 

(b) Statistical analysis of 3 x 2 x 2 design. 

(c) Example. 

(d) 3 X 2 X 2 X 2 design in blocks of 6 plots. 

(c) Extension to 3 x 2" design in blocks of 3 x 2"-' and 3 x 2"'^ plots. 

(f) 3x3x2 design in blocks of 6 plots. 

(g) 3 X 3 X 3 X 2 design in blocks of 6 plots. 

(h) Extension to 3" x 2 designs in blocks of 3"'*x 2 and 2 plots. 

(i) 3x3x2 design in a 6 x 6 quasi-Latin square. 

Confounding with one or more factors at four levels or eight levels . . 65 

(a) General method. 

(b) Example : 4 x 4 designs. 

(c) Combined varietal and manurial trials in Latin squares. 


6 lim IQSBL LieROflY 




28714 


»5 


Dummy treatments 

(a) Application of fertilizer at two different times. 

(b) Alternative designs. 

(c) 3 ^ 3 3 designs including quality differences. 

16. Arrangements with split plots 

(a) Structure and analysis of split-plot designs. 

(b) Example : a variet^ and manurial trial on oats. 

(c) Calculation of standard errors. 

(d) Efficiency. 

(e) Confounding of interactions in split-plot designs. 

(f) Half-plaid Latin squares. 

(g) Plaid squares. 

squares with split plots in varietal trials. 

(0 The Graeco-Latin square. 

(j) The hypcr-Graeco-Latin square. 

17. Variety trials— quasi-factorial designs . . • 

(a) The lattice. 

(b) Triple and balanced lattices. 

Lattice squares. 

(d) Three-dimensional lattices. 

designs : balanced incomplete blocks, 
ine muoduction of additional treatments in qpasi-factorial designs. 


68 


72 


85 


Notes 


1. 


Numb« of fig^ required in the compuUtions anil results. 
• W^encal divisors in the analysis of variance, etc. 

3 * U^ogonal functions. 

4. Hints on the use of calculating marh i nes , 

References and material for further reading 



94 


The Design and Analysis of 
Factorial Experiments 


I. Introduction. 

Factorial experiments are experiments which include all combinations 
of several different sets of treatments or “ factors.” Information is thus 
simultaneously obtained on the responses to the different factors, and also on 
the effects of changes in the level of each factor on the responses to the others. 

This Technical Communication has not been written with the object of 
convincing experimenters of the need for employing factorial designs, but 
rather for those who, while fully conscious of the advantages of such designs, 
find difficulty in laying them out and in analysing the results. It is, in fact, 
an attempt to give a comprehensive survey of the simpler types of design at 
present available, and a description of the appropriate methods of analysis. 
The reader who has not done so is advised first to read Prof. R. A. Fisher’s 
Design of Experiments^ where he will find a full account of the logical basis of the 
whole technique of modern experimental design. 


la. Principles underlying factorial design. 

The points at issue may be made clear by the consideration of an example. 
Suppose it is desired to introduce a new crop into a country, and that nothing 
is Liown of the most suitable varieties, type and quantity of manuring, the 
best cultivations, etc. The classical procedure would be to set up separate 
experiments to determine the best vaneties, others to investigate the manunal 
requirements, others (if indeed any were undertaken on this point) to determine 
the most suitable methods of cultivation, rates of sowing, etc. Unfortunately, 
however, we cannot conduct manurial experiments without choosmg some 
variety on which to conduct them, nor can we conduct varietal tnals without 
deciding on some level of manuring, a rate of sowing, wid^ between rows, 
cultivations, etc. Now it may happen that the effe^ of fertilizers 
different varieties are materially different, or that varieties that are good jiel er 
at wide spacings, owing to a rank habit of growth, are much infenor in ye 
(or other qualities) when sown at close spacings. Thus conclusions tha 
been laboriously reached on the correct level of manuring for one ^ 

be inapplicable to the variety finally chosen, and that variety may its 
incorrectly chosen through not realizing the possibilities of mcreasing th y 
of other varieties by changes in cultural practices. jjff^ri»nces 

Of course none of these misfortunes may occur. fnrartices 

may be substantially the same for all levels of manuring and all P . ’ 

and responses to fertilizers may be unchanged by change m P 

Indeed such experimental programmes would be completely , f 

not usually the Le. But even where it does happen that 

this kind exist, such methods are exwedinglv ie used 

experiments, for the reason that m factorial experiments aU the plots 



6 


many times over in making estimates of the effects of the different factors. 
Thus for example, with four factors, each at two levels, there are i6 treatment 
combinations. With 8o plots five replications of each combination are there- 
fore possible. The estimate of the effect of any one factor, if this effect is 
unchanged for variations of the other factors, is obtained from the comparison 
of the mean of the 40 plots receiving the higher level of this factor with the 
mean of the other 40. If four separate experiments ,are undertaken, one on 
each factor; then each experiment will contain 20 plots only, and the estimate 
of the effect of each factor will be obtained from the comparison of two means 
of 10 plots each. The precision will therefore be one quarter that of the 
factoriJ experiment, provided the standard error per plot is the same in both 
cases. Even if these four experiments are combined and one set of plots is 
used for the “ controls,” i.e. the plots receiving the standard level of each 
factor, there will only (with 16 " controls ”) be 16 plots for each factor, so that 
the precision will be | that of the factorial design. 

If the effects of some or all of the factors vary with changes in the other 
factors, the factors are said to interact^ and the estimates obtained as above 
from a factorial experiment will be the average of the effects of each factor in 
conjunction with the different levels of the other factors. At the same time 
estimates of the actual amount of the variation may be obtained by taking the 
differences of the effects of one factor at the different levels of the other factors. 
In such circumstances the results of a set of experiments containing single 
factors only will be misleading to an extent depending on the degree of variation 
in the effects. 


ih. Criticisms of factorial design. 

It is sometimes objected that what is really required is not the average 

effed of a factor, but rather the effect of this factor m conjunction with some 

parti^lar combination of the remaining factors, and that factorial experiments 

provide an estirnate of this having only low precision. Actually it rarely happens 

nat agricultural practices are in fact standardized in the way contemplated by 

the criti«, but even where this is the case the objection, as we have seen, carries 

no weight unless the variation in the effects is substantial, and even then the 

or!f -^*1 P*‘®cwion IS small if the levels of the remaining factors finally adopted 

between the extremes included in the experiment. In any 

St wm? beforehand the particular combination of the other factor^ 

at alh o 1^® 2 waste of time experimenting on them 

himself tn to survejr the whole field, and the experimenter who confines 

throdier fartnr« ^ factors, making a guess at the final levels of 

omer factors, is merely emulating the tactics of an ostrich. 

“ would Is that such and such a combination of factors 

that the aoolicatinn nf "JJif ^ fertilizer trials it may be maintained 

ridiculous!^^ Such without potash or nitrogen to a certain crop is 

evidence and ar? notions are usually based on entirely inadequate 

field of enquirv can expenrnenul test, but as evidence accumulates the 

enquiry can sometimes profitably be narrowed. Thus if it is known 


6 


that the application of some nitrogenous manure is certainly required, but the 
optimal level is still in question, the lowest level of nitrogen need not be zero, 
but a minimal dressing. There is also no objection in randomized block 
experiments to including an additional set of plots (outside the main factorial 
scheme) receiving no nitrogen, both for demonstration purposes and as an 
assurance that conditions have not radically changed. 

There is one further point which must be considered in assessing the 
advantages of designs of varying complexity. As the number of treatment 
combinations is increased the adequate elimination of fertility differences becomes 
more difficult. Conse(^uently the standard error per plot tends to be higher in 
factorial designs than in simple experiments involving a few treatments only, 
with a resultant lowering of the relative efficiency of factorial designs. The 
whole matter has been discussed in (9)* where it was shown that the loss of 
efficiency with properly designed experiments may be expected on the average 
to be much less than the gain due to the use of factorial design, quite apart 
from the information on the interactions between the different factors, w-hich 
can only be obtained from factorial designs. The loss of efficiency was found 
to be due mainly to the necessity of abandoning Latin-square arrangements, 
the discussion being written before it was realized that Latin-square designs 
could be utilized in some types of factorial experiments. This procedure, when 

it is possible, is likely to reduce the loss materially. 

It is, perhaps, typical of the superficial character of most criticisms or 
factorial design, that in many of them the efficiency of a design (i.e. relative 
amount of information per unit of work expended, or per plot when the work 
expended is proportional to the number of plots), is confused with the accuracy 
of the final comparisons, which accuracy can always be increased by increasing 
the size of the experiment, and therefore the number of replications, or by 
decreasing the number of treatments included in the experiment. 

The difficulties of the practical type that stand in the way of factorial design 
arise from the greater complexity both of the layout and the statistical analyse, 
and the larger number of plots that are required. How far these are of import- 
ance must be decided by the man in charge of the field operations. In this 
connection it should be remembered that any new technique is liable to present 
difficulties which fade away on closer acquaintance. 


ic Scope of the Communication. 

In the present paper factorial designs with factors at two j^veU only we 

•The number* refer to the references tt the end of the paper. 



7 


No attempt has been made to give recommendations as to the best procedure 
in the field, or to discuss such points as size and shape of plot, number of 
reolications.’etc., since these depend so much on type of crop and local conditions 
that ho discussion in general terms would be profitable. It may be well to 
emphasize here, however, that the additional complexity of factorial designs (and 
to a lesser extent all random arrangements) carries with it the necessity for careful 
organization if mistakes are to be avoided. The preparation of clear and simple 
plans, and a convenient system of numbering the fertilizer mixtures, etc., that 
are to be applied, will lighten the work of the man in the field, who is usually 
operating under adverse conditions, is frequently in a hurry, and is sometimes 
not very certain of the points at issue. Whenever the remark is heard, for 
instance, that random arrangements lead to mistakes in the field from which 
systematic arrangements are immune, it can be confidently predicted that the 
preliminary organization is inadequate. 

id. New material. 

For the benefit of the reader who is already familiar with the subject it may 
be well to indicate here what is new in this communication. Most important 
is the adaptation of confounding to Latin-square designs, so as to enable, for 
instance, a zs experiment to be arranged in the form of an 8 x 8 Latin square 
(pp. 31-35, etc.). The analogous adaptation of split-plot designs is also of 
considerable importance (pp. 78-81). The parallel use of quasi-Latin squares 
(lattice squares) in varietal trials (described in full elsewhere) is also outlined 
(pp. 87-8). 

No complete account of the designs involving some factors at two and some 
at three -levels (pp. 57-64) has previously been published, though some of these 
designs have been in use at Rothamsted and elsewhere for some years. The 
account of designs containing factors at two levels only (pp. 23-26) is also more 
complete than any previously published, l.astly, the 3^ design in blocks of 9 plots 
(pp. ^-8), a fairiy obvious extension of the popular 3® design, should be noted. 

On the computational side a new method of computing the treatment effects 
in experiments with factors at two levels only is given (p. 15), and attention 
has been paid generally to the best methods of carrying out the computations 
of the various designs. 


le. Notation, etc. 

It is assumed that the reader is familiar with the methods of design and 
analysis appropriate to simple experiments in randomized blocks and Latin 
squares, and in particular that he is thoroughly conversant with the analysis 
oj variance procedure applicable to experiments of this type. A selection of 
retere^ on the subject is given at the end of the paper. 

i e n followed is substantially that of Fisher’s Design of Experiments, 

letters are used to denote the treatments corresponding to the different 
hflc i[!l« ” * effects and interactions. The symbol [ab] 

treatmpnt^ reduced to mdi^te the sum of all the yields corresponding to the 
combination ab, the symbol ab, when it indicates a number, being 


8 


used to represent the mean of these yields, or this mean expressed in standard 
units (cwt. per acre, etc.). (In The Design of Experiments [ab) is used to denote 
either the sum or the mean according to the experimental material.) By analogy 
[A] and are taken to represent the algebraic sums of all the plot yields 
which go to make up the estimates of the main effect of a and the interaction 
of a and b, without any division, whereas A and A.B indicate these estimates 
expressed in terms of the yield of a single plot (or in standard units, such as 
cwt. per acre), with the conventional factor I, etc. introduced, as defined on 
page 10. In the case of factors at more than two levels the symbol [A] is taken 
to represent the whole set of totals, and A the whole set of means, corresponding 
to the various levels ^2> • • • of the factor a. 

One other new symbol is introduced. This is the word dev, which is 
used to denote the deviations of a set of numbers from their mean. dev“ is 
likewise used to denote the sum of the squares of these deviations. Thus 

dev — ) = dev <2 = ai-a,a2-a,aj-a 

dev“ (^1,^2, ) H dev'^ a = 5 (a-a)'^ = af^+af' + -na^ 

= a^-+aJ+ -J(i2i+fl2+ — 

= + . . . . - fl(ai+fl2+ — ) 

In a similar manner dev a. dev b might be used in covariance work to indicate 
the sum of the products of the deviations of two variables a and b. The occurrence 
of these quantities in statistical computation appears to be sufficiently frequent 
to justify the use of a special symbol, especially since they are only very clumsily 
representable by the current symbols when the are themselves complicated 
algebraic expressions. 

Algebraic formulae have been avoided as far as possible, and where it has 
been necessary to introduce them particular attention has been paid to writing 
them in the form required by the computer and also in a form exhibiting their 
structure, so that they are easily remembered. Thus the quantity Q on page 50 
has been so defined as to be analogous with the quantity [B.C], but the formula 
for 3Q is given because "'ill be computed in numerical work. The formula 
for B.C on the same page is given in terms of both Q and 3O, the latter being 
the form required for computation, while the former exhibits the structure. 

Free use has, however, been made of the algebraic notation of signs, brackets, 
etc., in setting out arithmetical calculations. Those who ran understand this 
notation (as for example the expression for the sum of squares on page 37} 
should have no difficulty with the algebraic formulx. 


2. A SIMPLE FACTORIAL EXPERIMENT ON POTATOES. 

The main features of factorial designs involving only two levels (often 
presence and absence) of each factor can best be illustrated by a simple examp _ 
We have chosen an experiment on the manuring of potatoes carried 0 

Wimblington in 1934. 



9 


Three factors, nitrogen, potash and dung, were included ; the 8 treatment 
combinations consisted of all combinations of : 

Sulphate of Ammonia (n) Sulphate of Potash (A) Dung (d) 

f None "1 J None 

1 1.I2 cwt. KjO per acre / ^ [8 tons per acre 

There were four replications in randomized blocks of i/6o acre plots. The plan 
of the experiment and the yields of the individual plots are shown in Table i . 


None 

0.45 cwt. N per acre 



Table i. Pun and yields in lb. 

Block Totals 

I 2296 

II 2291 

III 2369 

IV 237s 

Total 0331 


Block in Block IV 


Block I Block II 


flA 

u 

d 

nd 

kd 

d 

k 

nk 

291 

398 

312 

373 

407 

324 

272 

306 

(0 

k 

n 

nkd 

n 

nkd 

nd 

(0 

101 

26s 

106 

450 

89 

449 

r 

00 

106 

d 

(I) 

nd 

y 

nd 

nA 

n 

d 

323 

87 

3*4 

423 

361 

272 

1 *^3 

324 

nk 

k 

n 

nkd 

k 

(I) 

^ nkd 

y 

334 

279 

128 

47 J 

302 

131 

1 437 

445 


2a. Yields of the different combinations of treatments. 

The first step in the analysis of the results of a factorial experiment is to 

^culate the total yields of all the plots with each combination of treatments. 

The main features of the results are usually apparent from an inspection of these 

f’ ^ analysis of variance will, however, be necessary in order to make 

the formal tests of significance and assign standard errors to the various 
compansons. 

The yields of the individual treatment combinations in this experiment 
(converted to tons per acre) are given in Table 2 . 


Table 2. Yields of the different combinations of treatments 

(tons per acre). 



n 

k 

nk 

d 

nd 

y 

nkd 

1 

1 

Mean 

2.84 

a.8s 

7-49 

8.06 

8.59 

9-35 

11.20 

12.10 

7.81 






10 


2b. Main effects. 

Consider first the effect of dung. There are four relevant 
in Table 2. 


Response to dung 


r n and k absent ~ d - ( i ) = 
J n absent, A present = W - k = 
I n present, k absent ~ nd -n - 
and A present =nkd-nk = 


S -75 

371 

6.50 

4.04 


comparisons 


Mean response = D = 5.00 


These large apparent responses are sufficiently consistent to indicate that they 
are unlikely to be due to experimental error. 

The mean response, 5.00, will be called the main effect of dung, and will 
be denoted by the capital letter D. 

In a similar manner we have 


Response to potash 


{ R and d absent 
R absent, d present 
R present, d absent 
R and d present 


4.65 

2.61 

5.21 

27s 


Mean response » AT = 3.80 


Response to nitrogen 


' A and d absent 
A absent, d present 
A present, d absent 
A and </ present 


8^ o.oi 
= 0.76 

= 0.57 
= 0.90 


Mean response =N = 0.56 

There is, therefore, also evidence of a substantial response to potash, and possibly 
a small response to nitrogen. 


2C. Interactions. 

Examining the individual responses further, we see that the two responses 
to dung with potash absent are both substantially larger than the responses with 
poush present. Equally the responses to potash are substantially larger in the 
absence than in the presence of dung. The presence or absence of nitrogen, 

however, makes little difference in either case. 

The numerical differences in response to dung.in the presence and absence 

of potash are as follows : 

Difference in response to d ( n absent “ 2.04 

in presence and absence of A. t r present ~ 2.46 


Mean “ 2.25 

For reasons that will be apparent in a moment it is convenient to t^e 
this mean difference, namely - ..12. This is defined as the mteracoon between 
the two factors* d and k, and may be written D x X, D.K or Utf^. 


.Al« called the interaction. The m'ean interactior. over .11 the other factor, in the experiment is 

implied urdees the contrar> is stated. 



11 


A similar computation for diiferences in the responses to potash in the 
presence and absence of dung gives the identical results : 


Difference in response to A fa absent - 2.04 

in presence and absence oii. ^ a present - 2.46 

Mean - 2.25 


A momenfs consideration will show that this must be so. Thus the interaction 
between dung and potash is identical with the interaction between potash and 
dung. 

An alternative method of setting out the main effects and interactions 
between two factors is by means of a two-way table. In this example, taking 
the mean of n and no n in each case, we have the values of Table 3. 


Table 3. Mean of n and no n (tons per acre). 



1 No</ 

d 

Mean 

No A 

2.84 

8.97 

590 

k 

7.78 

11.65 

9.72 

Mean 

1 

1 5.31 

10.31 

7.81 


The main effects are given by the differences between the pairs of marginal 
means, and the interaction is given by the difference of the means of the two 
diagonals, i.e. by \ (2.84+ 11.65 -8.97 -7.78). 

In a similar manner, the difference netween the values -2.46 and -2.04 
gives an estimate in the change in the interaction between potash and dung in 
the preserice and absence of nitrogen. One quarter of this difference, i.e. one 
half the difference of the interactions, is defined as the interaction between the 
three factors* and may be written N Kx D, N.K.D or NKD. 

2d. Calculation of the main effects and interactions from the experimental yields. 

It will readily be seen from the above remarks that the main effects and 
interactions may ail be obtained by subtracting the mean of 4 of the yield 
values of the individual treatment combinations from the mean of the other 
4, or alternatively by taking the sum of 4 of these values less the sum of the 
other 4, and dividing the result by 4. The actual signs are given in Table 4. 


Table 4. Main effects and interactions in a three-factor experiment. 



12 


These signs may be derived in various ways. The simplest is to write 
down the signs for the three main elfects, and then to form the interactions 
between each pair of mean effects by writing + for two +’s or two -’s, and - for 
a - and a + . A further application of this process gives the interaction between 
the three factors. If there are more than three factors the table may be extended 
by still further applications of the same rule. 

The following formal expressions for the interactions are also worth noting : 

N‘ i{n- i) (A+ i) {d+ 0. 

N.K= i(n-i) (A - 1) (rf+ I). 

N.A’.Z) = i (« - 0 (*-0 (‘^- 0 - 

If these expressions are expanded by the ordinary rules of algebra the appropriate 
expressions for the main effects and interactions in terms of the treatment 
combinations will be obtained. With four factors the fraction will be with 
five etc., and with only two factors 

If the above method of calculation be applied to our example the main 
effects and interactions will be found to have the values given in Table 5. 
Some of these have been obtained already. 


Table 5. Main effects and interactions. 

N ^ + 0.56 N.K = + 0.18 

K = +380 N.D = +0.27 N.K.D « -o.io 

Z) = + 5.00 K.D = - 1.12 

A more mechanical method of obtaining these values is given in the next section. 

These values clearly all have the same standard error, since they are each 
one quarter of the sums and differences of the yields of the eight treatment 
combinations. As we shall show presently, the estimate of this standard error 
(21 degrees of freedom) is + 0.177. more than twice its standard 

error may be judged significant. Thus all three ma.n effects and the >nteracti<)n 
between potash and dung, the two factors producing the large effects, are 

sign^is commonly found in agricultural trials, factors 

which produce large main effects may show evidence of interactions, fac 

which Voduce sLll main effects usually show no 

A little consideration will show that this is what may be 

grounds. The interactions are in genera! likely to be small in companson 

with the corresponding main effects. 

2€. Interpretation of main effects and interactions. 

I, will be clear from what has already been written that the whole se o 
main effects and interactions, together with the mean yield, is equivalent 

the yields of the individual treatment or 

The response to any factor or combination of ' P “ 

absence of any other factor or factors (the mean being taken wer all tartw 

not under consideration! can be written down very simply m er 

Xcts and interactions.' The rules will be obvious from the study of Table 



13 


Table 6. Responses in terms of main effects and interactions. 



Expression in 

terms of 

Response to : 

treatin^nt combinations 

main effects and interactions 

(A absent) 'l f 

d (A present) S (mean of « < 

and A together J and no n) 
d (fl and A absent) 
d and A (n absent) 
d, A and n 

J[nd-n+d-(i)] 

i[«Ad-nA+Ad-A] 

\[nki-n+kd-{i)] 

d-(i) 

Ad-(i) 

nAd-(i) 

D-K.D 

D+K.D 

D ' K 

D-N.D-K.D^N.K.D 

D-^K~N.D-N.K 

D^K^-N+N.K.D 


The interactions may thus be regarded as correcting terms which adjust 
the values of the main effects (which would be additive if the interactions were 
all zero). In this example the response to d where k is absent (mean of n and 

“ ”) D - K.D - + 5.00 + 1.12 = + 6.12 

and where k is present is 

D + K.D - + 5.00 - 1. 12 = + 3.88 

The response to both d and k (mean of n and no n) is 

D + K = + 5.00 + 3.80 = + 8.80 

These responses are those given by the differences of the values of Table 3. 

It should be particularly noted that the interaction between d and k does 
not enter into the latter response. In the same way only ie three-factor 
interaction enters into the expression for the simultaneous response to all three 
fertilizers : 

K-^ N N.K.D= + 0.56 + 3.80 + 5.00 - o.io = + 9.26 

(This response can be obtained from Table 2.) If the interactions between 
the three factors were ignored, therefore, the estimate would be 

D+ K + N= + 9.36. 

The yield of any treatment combination may also be obtained from the 
mam effects and interactions, together with the mean yield, being equal to the 
mean yield and the sum of plus or minus one half of all the main effects and 
mteractions. The signs are given by Table 4. Thus, for example : 
ftrf- mean + \{- N + K - N.K-\- D- N.D+ K.D - N.K.D } 

It will be noted that in the order shown the table is symmetrical about the 
diagonal through the top right-hand comer, so that the expression for kd 
(equivalent to « absent) is obtained from that of N by replacing (1) bv the 
mean, n by ^N, etc., and changing signs if the sign of (i) is negative^ ^ 


2/. General remarks. 

fnrmJ!'® statement of the results in terms of main effecte and interactions thus 

aSon o"n hf s^^iniarizing a factorial experiment, and concentraring 

attention on its mam features. It should not be forgotten, however that th? 

are’reairri^of 

r ^mteractions being measures of the departure of the observed 
th#. m • the law implied in the defiiUtion of the main effects Her** 

m effects .are so defined as to imply an additive law between the effects 


A 


14 


due to the three factors. This is statistically convenient, and in agriculture 
appears to provide a good representation of the type of effect frequently observed. 
But it should be clearly understood that the additive law has been provisionally 
imposed by the statistician and is not implicit in the data. 

The present example has itself afforded an illustration of a simple type 
of departure from the additive law. Others more complex will occasionally 
arise, and the experimenter should then bear in mind that the formal presentation 
of the results in terms of main effects and interations may not necessarily be 
the best course to pursue. Equally, however, he should avoid giving exaggerated 
emphasis to some statistically significant but isolated high order interaction 
which has no apparent physical meaning. If we are using the i in 20 level 
of significance one out of every twenty of the main effects and interactions will 
on the average be judged statistically significant even when the treatments 
produce no effects at all. Such anomalous results, therefore, together with 
non-significant effects, should be placed on record and judgment reserved until 
further information has accumulated. 

Conversely, a verdict of non-significance does not imply that no effect exists. 
It merely implies that the observed apparent effect would arise more frequently 
than I in 20 (or i in 100) times by chance if there were no real effect. The 
application of exact tests of significance to all experimental results is a salutary 
habit which discourages the discussion of small apparent differences whose 
magnitude is very ill determined, but it should not be forgotten that the main 
object of most agricultural field trials is to estimate as accurately as possible 
effects of which the experimenter is normally quite prepared to admit the 
existence. A secondary requirement is the determination of the magnitude of 
the errors to which these estimates are subject, thus fixing limits between 
which the true value of the effect is likely to He. Consequently tests of significance 
are replaced by estimates of standard errors and fiducial probability. 

Thus, for example, it is reasonable to suppose that the application ot 
nitrogenous fertilizer to a crop on a given area will always alter the yield of that 
crop, although the alteration may in certain cases be very small. Non-signincant 
results must not be taken as implying that no effect exists in such experiments, 
though they can be taken as implying that the effect lies within certain limi , 
In conjunction with other results, also not in themselves significant, mey may 
show quite clearly the existence of a small, but appreciable, effect. y 

the practice of finding the average response to a fertilizer at stations . 
response is significant is meaningless, for by making this selection 0 ® , 

we automatically select a majority of stations at which the error in the es im 
response is positive. 


3. Statistical analysis of a 2x2x2 experiment. 

The discussion in the last section was designed to illustrate the 
aspects of the results of a simple factorial design. The routine . 

an experiment is, of course, much abbreviated, and in the presen , 

propose to give an outline of the various steps which reoetition 

order to arrive at these results expeditiously and without unnec y P 

of the various calculations. 



15 


1. Yields of plots. Set out the yields as in Table i, rounding off, if 
necessary, to three significant figures. (See note i, p. 91). 

2. Totals of individual treatment combinations and block totals. These are 
shown in the yield column of Table 7 and in Table 1 respectively. 

3. Calculation of main ^ects ^ interactions in terms of th totals of 
the individual treatment combinations. The main effects and interactions can be 
calculated from the totals of the individual treatment combinations by means 
of the table of signs in the last section. No division of the resultant totals need 
be carried out. These totals are shown in column (3) of Table 7, each being 
the sum of 16 plot yields less the sum of the other 16. 

A more systematic and shorter method, which avoids the trouble of picking 
out ^e relevant treatment combinations fa process which is laborious when there 
are a large number of factors) is that shown in Table 7. 

The yields must first be arranged in a standard order of the type shown, 
each factor being introduced in turn, and being followed by all combinations 
of itself and the factors previously introduced. Thus the last four combinations 
are formed by taking d in conjunction with the first four combinations. 

Column (i) is then formed. The first four numbers are the sums of the 
four pairs of numbers in the yield column, and the last four numbers are the 
differences of these pairs, the upper number being subtracted from the lower in 
each case. Thus 2321= 1118+ 1203 and + 85- - 1118 + 1203. Column (2) is 
formed in the same manner from column (i), and column (3) from column (2). 
Since there are three factors these three applications of Ae process complete the 
calculation. The total, and the main-effect and interaction totals, are obtained 
in column (3), each effect and interaction appearing opposite the corresponding 
small letters in the first column. 


s 

Table 7. Calculation of treatment effects. 


Treatment 

Yield 

(0 

425 

n 

426 

k 

1118 

nk 

1203 

d 

1283 

nd 

1396 

kd 

1673 

nkd 

1807 

SX 

± 37-2 

Conversion 

60 

f^tor 

2240 X 4 


(0 

(2) 

( 3 ) 

Effect 

851 

3172 

933 ^ 

Total 

2321 

6159 

+ 333 ** 

N 

2679 

+86 

+2271 •• 

K 

3480 

+247 

+105 

NX 

+i 

+1470 

+2987** 

D 

+85 

+801 

+161 

N.D 

+113 

+84 

-« 69 ** 

K.D 

+134 

+21 

-63 

NX.D 


±105.4 

60 

2240 X 16 


=.00669643 =.00167411 

Significance leveU. (column 3) : 5%: 219; 1%: 298 


Asteriska denote aignificant reeults at i% level. 




There 

calculation. 


are no very simple checks on the intermediate 
Complete accuracy should therefore be aimed 


stages of the 
at, particular 


16 


attention being paid to signs.* The sum-of-squares check, described below 
controls the whole calculation except for the signs of the last column, which 
should be independently checked. Interchanges in the yield column must 
be avoided by systematic computation. A usefol partial check is provided bv 
the sum, which is independently obtained from the block totals. This and 
the independent calculation of the interaction between all factors check all 
of the yield column and column (i), and one half of column (2). 

A more elaborate example of the method, involving 5 factors, is shown 
in Table 22, where a systematic check for each column is introduced. 

4. Calculation of sums of squares for blocks, treatments, and error. The 
ordinary methods of the analysis of variance are followed. These give the 
analysis of Table 8. It is advisable to record the correction for the mean as 
this is often required in subsequent calculations. 

Table 8. Analysis op variance. 

D.F. Sum of squares. Mean square 


Correction for mean . . 2720861.3 


Blocks 3 774.1 258.0 

Treatments 7 458718.0 

Error 21 7287.6 347.0 


Toul 31 466779.7 

5. Partition of the treatment degrees of freedom and sum of squares. The 
7 degrees of freedom for treatments can be divided into 7 single degrees of 
freedom representing main effects, interactions between two factors, and the 
interaction between all three factors. The seven sums of squares may be 
calculated by squaring the quantities of Table 7, column {3). They are shown 
in Table 9. 


Table 9. Partition op treatment sum of squares. 


N 

D.F. 

I 

Sum of squares 

34653 

K 

1 

161170.0 

N.K 

. . 1 

344-5 

D 

I 

278817.8 

N.D 

1 

810.0 

K.D 

1 

. , 13986.3 

N.K.D 

. . 1 

. . 124.0 


Toul 7 

458717-9 


Each square must be divided by 32, since it is the square of the total of 
± 1 times the yields of each of 32 plots. (See note 2.) Thus 

161170.0= 2271V32 


•The sum of two numbers of the same sign is the arithmetic sum and has ioelf this sign. T^e 

nSer^ of opposite signs is the arithmetic difference and ha. the sign of t^ JJoJ! 

STdifference of two number* change the sign of the number to be subtracted and take the sum a. aw) 

Examples will be found in Table 2a. 


i 


17 


These 7 degrees of freedom are orthogonal and therefore the sum of the 
7 sums of squares is equal to the ordinary treatment sum of squares. (See 
note 3.) This provides the check of Table 7 mentioned above, and also checks 
the treatment sum of squares and' the correction for the mean in Table 8. 

Since the tests of significance can be performed by the / test (as described 
below) there is in practice no need to write down the separate sums of squares 
for each main effect and interaction, and Table 9 will consequently be omitted. 
All that is necessary is to sum the squares of column (3) of Table 7 (excluding 
the sum) on the machine, and divide the result by 32. 

6. Calculation of mean squares and tests of significance. The separate 
components of the sum of squares for treatments can be tested for significance 
by means of the z test. Since in this case each corresponds to a single degree 
of freedom, however, it is simpler to use the i test, which is equivalent to the 
z test for «i * I. 

Since there are seven separate effects to be tested it is worth calculating 
the 5% and 1% points. For 21 degrees of freedom t= 2.080 for the 5% point 
and 2.831 for the i % point. The estimate of the standard error of a main-effect 
or interaction total is V32 x 347,0 « 105.4. "The 5% and i % significance levels 
for the main-effect and interaction totals are therefore 105.4 ^ 2.080* 219.2 and 
105.4 X 2.831= 208.4. Thus we see immediately that N, K, D, and K.D all 
attain the 1% level of significance, the remaining interactions not being 
significant. 

7. Conversion of yields and presentation of the results. The yields should 
be converted to the customary agricultural units, and the results presented in 
the form most suitable for making clear the main features, and for combination 
with results of other experiments. Many alternative forms are possible, and 
the exact form will depend largely on circumstances. In general it is a good 
rule to present tables • showing all main effects and interactions between two 
factors (md also any interactions between three or more factors which appear 
to be of interest) either directly or in the form of two-way tables with marginal 
totals. Vanous examples of the different types of presentation will be found 
in this communication. 


The rwults of the present experiment have already been discussed in detail 
m the previous section. Both Tables 2 and 5 can be derived directly from the 
conversion of the appropnate columns of Table 7. Notice that the conversion 

7 is that applicable to the totals of 16 plots, 
of 16 plo^^ involve 32 plots, i.e. the difference of two sums 

appropriate standard errors should be written in Table 7 and converted 
at the same time as the numbers to which they refer. 

effel ' irefltment combinations from the main 
ofSi A procedure similar to that adopted for the calculation 

or the m^n effects and interactions from the yields of the individual treatmpnt 
»mbmatons .s available for the reverse cLpuUtion; “procS 's 
particularly usefUl m experiments involving a large number of factors when a 


18 


table giving the mean yields of the combinations of two or three factors averaged 
over the remaining factors is required. It can also be used to reconstruct the 
yields of the individual treatment combinations, when these latter are not 
available. 

As an example we have derived Table 3 from Table 5 and the mean yield. 
The computations are shown in Table 10. Since only the two factors k and d 
are involved effects involving n will not enter into the calculation. 

K, Dy K.D and twice the mean yield are arranged in a column, the order 
being the same as that adopted in Table 7, but beginning from the bottom. The 
computation process used in Table 7 is now applied. Only two applications are 
necessary as only two factors are involved. The last column is divided by 2 to 
give the required mean yields. 


Table 10. Calculation of yields of treatment combinations 



FROM MAIN EFFECTS AND 

INTERACTIONS. 



Effect 

0) 

(*) 

Yield 


K.D 

. . - 1.12 

+2.68 

23-30 

11.65 

kd ] 

K 

+3.80 

20.62 

15-54 

7-77 

k 1 Mean over 

ft ^ 1 

D 

+5.00 

+4.92 

17-94 

8.97 

a 1 R and no ff 

2 (Mean) . . 

15.62 

10.62 

5.70 

2.85 

(i)J 

a • « « 


As an exercise in the more extended application of this process the yields 
of Table 2 may be derived, using all the effects of Table 5. 


4. Confounding. 

Confounding is a device whereby the necessity of including eveiv combination 
of the treatments of a factorial design in each block (or row and column m a 
Latin square) is avoided. This enables the block size to be kept small even when 

the number of treatment combinations is quite large. 

In a confounded experiment the treatment combinations of each replicauon 

are divided into two or more groups (each group beineassiped a sepwat 
block) in such a way that the contrasts between the different 
high-order interactions, which as we have already seen are usua ly of le^ mterwt 
thin the main effects and interactions between two factors only. Thu^n any 
one replication the contrasts representing ce^m interactions ^ 

confouked, with the block differences, and in consequence in th s repl^tio^ 
most* of the information on these interactions is sacrificed. the 

reduction of block size has been effective in reducing the 
precision of all the remaining comparisons is , Ljf^unding, 

founding different interactions in different replications, i.e. by p / . 

some information may be retained on all 

precision resulting from the confounding is sufficiently gre ? ^ 

Lfounded interactions may be more accurately determined than wouia 

case if the experiment were not confounded. 

•A imall amount arising from block comparisons remains, but is not in practice utilize ( 



19 


4a. Example to illustrate confounding. 

A simple and useftil example of confounding is provided by the arrangement 
of a three-Mctor experiment in blocks of four plots. If the factors be represented 
by a, bind c and we arrange the four treatment combinations 

(i), ab, ac, be, 

in one block of each replication (randomizing within the block) and the other 
four combinations 

a, b, c, abc, 

in the other block} the contrast between these two sets of blocks is equivalent 
to the three-factor interaction A.B.C. Consequently all information on this 
interaction is lost, except the small amount arising from inter-block comparisons. 
It is easily seen, however, that block differences are eliminated from all the 
other interactions and from all the main effects, since each of these comparisons 
involves two plots with a plus and two plots with a minus sign from each block. 

For reasons given in (9) it is best to arrange that neighbouring blocks 
are of unlike type, so that the blocks themselves form randomized pairs, each 
pair comprising a complete replication. 

In order to illustrate the modifications that are necessary in the statistical 
analysis we will reconstruct the analysis of the potato experiment already given, 
on me supposition that it was arranged in this way and gave yields identical 
with those actually obtained. This will make clear the parallelkm as well as 
the differences between the two analyses.* Actual examples of confounded 
experiments are given later in the paper. 


4b. Statistical analysis. 

The partition of the degrees of freedom in the analysis of variance is given 
in Table 12. 

The formal analogy of this partition with that of split plot arrangements, 
discussed in Section i6a, should be noted. The blocks correspond to whole 
plots, arranged in blocks of 2, and the plots to sub-plots. 

The appropriate error for testing N.K.D is " within block pairs.” Not 

only is this likely to be large, because it involves comparisons between whole 

blocks, but it is also very ill-determined, being based on only 3 degrees of freedom. 

Normally, therefore, the partition of the sum of squares ” between blocks ” is 

not performed, only the three components, blocks, treatments and error beine 
calculated. ® 


The steps of the whole calculation are as follows. 

• the sum of squares for blocks from the 8 block totals friven 

in Table 1 1). 


wwpriile to . given experiments 
>( » found on anaiysis that the elimination of the sum of tquarea 
to **'* estimate of the experimental error, as in the potato experiment desSbe? 


20 


Table ii. Block totals, N.K.D confounded. 

la Ib Ila Ilb Ilia Illb IVa IVb 

1163 1133 1157 1134 1168 1201 1209 1166 

Blocks (b) contain nkd. Note that the sum of the b's less the sum of the a's equals 

2. Calculate the sum of squares for the unconfou’-ided treatment comparisons 
by summing the squares of the relevant totals from Table 7. Check this (and 
Table 7) by calculating the sum of squares for all treatments (ignoring con- 
founding) from the yields of the separate treatment combinations, and deducting 
the N.K.D component. 


Table 12. Analysis of variance, N.K.D confounded. 

D.F. Sum of sauares 


Between 

blocks 


Between block pairs 

• • • « 3 

774.1 

258.0 

N.K.D 

I 

124.0 

124.0 

. Within block pairs 

• • 3 

421.9 

140.6 

[. Total 

• • 

1320.0 

188.6 

/Treatments* 

6 

•• 458593-9 

-- 76432-3 

\ Error 

18 

. . 6865.8 

381.4 

Total 

31 

• • 466779-7 



*Main effect* and interactions between two factor* (see Table 9). 

3. Calculate the error sum of squares by subtraction. The remainder of 
the analysis of variance and the tests of significance proceed as before. 


4c. Presentation of results. 

The presentation of the results requires slight modification, since any 
comparison involving N.K.D is affected by block differences. The best procedure 
is to divide the individual treatment combinations into two categones, according 
as they fall into blocks (a) or (b), arranging the results as in Table 13. 


Table 13. Yields of treatment combinations, N.K.D completely confounded. 

^ . /t. V I 



(0 

Blocks (a) 
nk nd 

kd 

Unadjusted 

2.84 

8.06 

9-35 

11.20 

Assuming N.K.D = o . . 

2.79 

8.01 

9-30 

11.15 



Blocks (b) 


n 

k 

d 

nkd 

2.85 

7-49 

8-59 

12.10 

2.90 

7-54 

8.64 

12.15 



N.K.D will be omitted from the table of mam effects 

If the table of individual treatment combinations is adjusted so ^ ^ 

mean of the first four eomponents is equal to the “S 

the addition of one half of the apparent value of N.K.D, here Jo » 
lach of the second four and the deduction of the same amount ^om Mch of th 
to four, this will eliminate block effects at the “st of as^'ng ^ 
is negligible. This procedure is not to be '^commended as a genera^^ 

but is sometimes of value in the popular “e„md ^ means 

actions between two factors, being unconfounded, can be present y 

of the ordinary two-way tables. 

The ; (5 ^ 

z, 



2t 


Ad. Example of partial confounding. 

Instead of confounding the three-factor interaction A.B.C in every replication 
of a three-factor experiment the two-factor interactions may also be confounded 
in their turn. Thus the potato experiment might have been arranged in 8 blocks 
of 4 plots each, the interaction N.K.D being confounded in the first pair, the 
interaction N.K in the second pair, the interaction N.D in the third and the 
interaction K.D in the fourth. The treatments would then have had to be 
Plotted to the pairs of blocks in the manner shown in Table 14. 


Table 14. Arrangement of treatments and block totals, partial confounding. 


Interaction 


i 







confounded . . . • 

N.K.D 

N.K 

N.D 

K.D 

Block 

la 

Ib 

Ila 

Ilb 

Ilia 

Illb I 

IVa 1 

IVb 

1 

(0 

n 

« , 

l« 

n 

mm 


(0 


nk 

k 

k 

WMm 

d 



n 

Treatments . . . . 

nd 

d 

nd 

nk 

nk 

nd 

nk 

kd 


kd 

nkd 

kd 

nkd 

kd 

nkd 

nd 

nkd 

Total 

1163 

1133 

1106 

1185 

I2o8 

1 1161 

1259 

1116 


Adjuatment per plot. . 

-2.4 

+2.4 

48.8 

-8.8 

->4-S 

! +14-5 

+4.0 

-4.0 


If this procedure had been adopted, full information on the interaction 
N.K.D would have been obtained from the block pairs 11 , III and IV, but 
no information would have been obtained from blocks I. Similarly, full informa- 
tion on N.K would have been obtained from blocks I, III and IV, etc. Thus 
three-quarters of the information available on the main effects would be available 
on each of the interactions. 


46. Statistical analysis. 

Certain modifications are required in the calculations of both the estimates 
of the interactions and the analysis of variance. 

. The general principle to be followed in cases of partial confounding is to 
^tiinate each partially confounded degree of freedom (or set of degrees of 
freedom) only from those blocks in which it is not confounded. Sums of squares 
are ralculated from these estimates in the ordinary way, account being taken 
of the fact that they are based on a reduced number of plots. The sum of 
^uares for blocks is computed from the block totals in the ordinary manner. 
Ihe calculation will here run as follows. 

Ue block totals must first be calculated. These are given in Table 14. 

in »,iit ® . interactions must be recalculated, omitting the blocb 

m which they are confounded. This can be done directly or by noting that 


the signs. In our example 

N.D 
K.D 
[N.K.D 


+ 105+ 1106 - 1185 
+ 161 + iao8 - ri6i 

- <69+ 1259 ~ 11*6 

- 63+ 1163 - 1133 


+a6 

+208 

- 526 

- 33 











22 


The analysis of variance will now contain a degree of freedom for each 
interaction, since each can be estimated. The sum of squares for the interactions 
will be obtained by summing the squares of the above four numbers and dividing 
by 24, since each is the sum of plus or minus 24 yields. The sum of squares 
for the main effects will be identical with that already obtained in the uncon- 
founded design. The sum of squares for blocks comes directly from the block 
totals. Finally the error sum of squares is obtained by subtraction. We thus 
obtain the analysis of variance shown in Table 15. 


Table 15. Analysis of variance, partial confounding. 



D.F. 

Sum of squares 

Mean square 

Blocks 

7 

4499.0 

642.7 

Main effects 

3 

4434531 

147817.7 

Interactions 

4 

13404.4 

3351-1 

Error 

. . . . 17 

5423-2 

319.0 

Total 

.. .. 31 

466779.7 



In this analysis it is not possible conveniently to subdivide the degrees of 
freedom for blocks, as was done when N.K.D was totally confounded. 

The reader will notice that the estimates of error vary considerably in the 
three analyses^ Tables 8, 12 and 15. This, however, does not indicate that the 
errors are different, since each is in fact an estimate of the same error. The 
variation is due entirely to random sampling variation resulting from the omi^ion 
from the “ error ” of Table 8 of certain degrees of freedom, those “ within 
block pairs ” in Table 12, and others less easily isolated in Table 15. 

The estimates of the interactions flow directly from the modified totals 
[N.K]', etc. Since each comprises 24 plots the conversion factor must be that 

appropriate to the total of 12 plots, i.c. ^ giving, in tons per acre, 

the values 


N.K= + 0.06, N.D^ + 0.46, K.D^ - 1.17, -0.07 

values which, as should be the case, are not substantially different from those 

already found. 1 i'« 

The estimate of the standard error of each of the totals is 


clearly 

V24X 319.0= ± 87.5 f L • t 

and this converted into tons per acre gives ± o.i Q^ As befo re, the estimae 

of the standard error of the main effects will be V32 x 3 i 9 -o> giving ± 0.170 
“""utinrAe t test we find the s% and i% 

0.411 and 0.565. Thus K.D is significant at the i% level, md N.D “ow at^ 

si^ificance at^the 5% level. This is an 2 

part of the data only, effects which are insignific^t when the whole of da^^ 
is taken into account, may by chance attam significance. Such trats m , 

course, not vaUd, since they transgress the nece^ ,“"lT"nroDriate test 
chosen effect in any given experiment there can be only one appropr 

of significance. 



23 


If we require the standard error of some function of the main effects and 
interactions, as for example the response to potash in the presence of dung : 

K.D= + 3.80-1.17= + 2.63 

the ordinary rule of taking the square root of the sum of the squares of the 
standard errors of the two parts is ap^icable, since these parts are orthogonal 
and therefore in effect independent. The required standard error is therefore 
V(o.i7o’*+ 0.195“)= ± 0.259. 

4f. Presentation of results. 

In partially confounded experiments the ordinary table of the yields of the 
separate treatment combinations is misleading, since the values are affected bv 
block differences, which may be veiy large. Since every interaction is determinea, 
however, a table of adjusted yields may be computed. The experimenter will 
be well advised, wherever possible, to avoid presenting a comprehensive table 
of this nature, since it is troublesome to compute, and is also troublesome to 
interpret, since the various comparisons are not all of the same accuracy. 
If, however, such a table is required, it can be calculated from the main effects 
and interactions by the method already given. Tables embracing certain selected 
factors only are likely to be of more interest and utility, and can be similarly 
computed. Thus in the present experiment we might reasonably exhibit a 
two-way table of the combinations of dung and potash, similar to Table 3. 

A useful check on the construction of tables of adjusted yields is provided 
by calculating the adjustments to the original yields necessary to eliminate block 
differen^. Thus in our example the difference between blocks Ib and la 
should, if there were no block effects, give the interaction N.K.D. Since [N.K’vZ)] ' 
contains 24 plots and blocks la and Ib together contain 8 plots the difference 
should be 5 -ii. Actually it is 1133- 1163 = — 30. The adjust- 

ment per plot is therefore i {30- ii)= 2.4, this being added to plots in Ib and 
sublratted from plots in la. The other adjustments shown in Table 14 are 
similarly computed. Thus the adjusted yield of combination nkd is 

1807+2.4-8.8+ 14.5-4.0= iSii.i 

The reader will do well to satis^ hunself that the use of these adjustments gives 
a table of adjusted yields which is identical with that obtained by reconstruction 
trom the mam effects and interactions. 


DESIGNS. 


5. Systems of confounding for 2 x 2 x 2 x ... 

in» section the confounding of a single degree of freedom correspond- 

factors of a 2 X 2 X 2 design was explained. 

iWlvin* f ^ confounding applicable to factorial designs 

involvmg four or more factors, each at two levefe, ife. designs of the form 2" 

can or interaction 

StS otheXlf T'T r “ combiStions 

tn Uie other half, and it is therefore only necessary to assign these two groups 


24 


to the different blocks. If there are a large number of factors, however, a higher 
degree of confounding may be advisable. With 5 factors, for instance, there are 
32 treatment combinations. If these are divided into groups of 8 in any way 
then the 3 degrees of freedom corresponding to the comparisons between the 
four groups will be confounded. The problem is so to choose the groups that 
these 3 degrees of freedom correspond to high-order interactions. 

The possible solutions of this problem are provided by the following general 
rule : 

If three degrees of freedom are to be confounded in a 2" design any two, 
corresponding to main effects or interactions, may be chosen at will. The 
“ generalized interaction ” between these two degrees of freedom will then also 
be confounded. (By the generalized interaction between A.B.C and A.D, for 
example, is meant B.C.D, A being struck out as it occurs in both of the first two 
expressions.) 

5<2. Confounding with jive factors. 

In the case of the 2* design the main effects and interactions are those 
shown in Table 16. 


Table 16. Main effects and interactions with five factors. 


Main 

effects 


Interactions benveen 


two factors 

three factors 

four factors 

five factors 

A.B B.D 

A.C B.E 

A.D CD 

A. E C.E 

B. C D.E 

A.B.C A.D.E 
A.B.D B.C.D 
A.B.E B.C.E 
A.C.D B.D.E 
A.C.E C.D.E 

A.B.C.D 

A.B.C.E 

A.B.D.E 

A. C.D.E 

B. C.D.E 

A.B.C.D.E 


A 
B 
C 
D 
E 

If A.B.C.D.E is confounded, and also one of the interactions involving 
four factors, say B.C.D.E, then by the rule the main effect A is also confounded. 
The confounded set is thus 

A; B.C.D.E; A.B.C.D.E 
The only other type of set containing A.B.C.D.E is 

A.B; C.D.E; A.B.C.D.E 
There is also the type of set 

A.B.C: A.D.E; B.C.D.E 

This is the most useful of all, for no main effect or interaction between two 
factors is confounded. There are 15 such sets, for the factor 
to A can be chosen in 5 ways, and the remaining four factors can 

divided into two pairs in 3 ways. . • r ui R 

The actual partition of the 32 treatment combinations into four bloc^ 0 , 

SO that the chosen degrees of freedom are confounded, is effec ^ J J 
down the signs of any two of the three confounded degrees of 
the manner of Table 4, and allocating the four combinations + + . + . 

and - - so obtained to the four blocks. 



23 


Bv the device of partial confounding different sets may be confounded in 
the different replications. With 5 replications a balanced group of sets such 
as that given in Table 17 can be used, each of the interactions between three and 
four faSors being confounded once and once only. In this case A of the 
information (relative to that on the unconfounded degrees of freedom) will 
be sacrificed on these interactions. 

Table 17. Balanced group of sets for 2* design in blocks op 8 plots. 

A.B.C: A.D.E; B.C.D.E 
A.B.D; B.C.E; A.C.D.E 
A.C.E; B.C.D; A.B.D.E 
A.C.Dj B.D.E; A.B.C.E 
A.B.E; C.D.E; A.B.C.D 

The rule given above is capable of extended application. Thus if blocks 
of 4 plots are used in a 2^ design and the interaction B.D is chosen, in addition 
to the first set of Table 17, the full set of 7 confounded interactions is 

B.D; C.E; A.B.C; A.D.E; A.C.D ; A.B.E; B.C.D.E 

The eight combinations of signs arising from any three of these interactions 
(the third not being the gene^ized interaction of the other two) will give the 
partition into the eight blocks. 

Balanced groups of 5 sets of this type also exist, one of these groups being 
that given in Table 18. 


Table 18. Balanced group of sets for z* design in blocks op 4 plots. 

AB, CD, ACE, ADE, BCE, BDE, ABCD 

AC, DE, ABD, ABE. BCD, BCE, ACDE 

AD, BE, ABC, ACE, BCD, CDE, ABDE 

AE, BC, ABD, ACD, BDE, CDE, ABCE 
BD, CE, ABC, ABE, ACD. ADE, BCDE 


5^. Confounding with six factors. 

The confounding of experiments including sbc factors follows similar lines. 
With blocks of 16 treatments the most useful sets are those of the type 

A.B.C.D ; A.B.E.F ; C.D.E.F 
and with blocks of 8 treatments those of the type 

A.C.E: B.D.E; B.C.F; A.D.F; A.B.C.D; A.B.E.F; C.D.E.F 

With blocks of 4 treatments arrangements confounding 3 two-factor, 8 three- 
factor, 3 four-factor and the six-factor interaction are possible, and may be 
" interacting ” on the sets of Table 18 with E.F, B.F, C.F, D.F 
Md A.F respectively. A balanced group of sets will be thus attained. Balance 
IS ^0 possible in 5 replications with blocks of 16 treatments, but with blocks 
ot 8 trwtinents, rather curiously, 10 replications; are required for complete 
balance : wiA 5 replications and blocks of 8 plots one three-factor interaction 
must be confounded twice while another is not confounded at all. 


26 


5f. Confounding ivith four factors in blocks of 4 plots. 

The best type of set for non-balanced arrangements is 

A.B ; A.C.D ; B.C.D 

but for complete balance this clearly demands 6 replications, and moreover ^ the 
relative information on the three-factor interactions is lost. The alternative 
group of sets given in Table 19 gives balance in 4 replications, and sacrifices 
only I of the relative information on the three-factor interactions and J (instead 
of I) of the information on the two-factor interactions. 

Table 19. 2* design. 

A.B : C.D : A.B.C.D 

A.C : B.D ; A.B.C.D 

A. D ; A.B.C ; B.C.D 

B. C : A.B.D; A.C.D 

There is the further group of 5 sets (Table 20) which confounds every 
degree of freedom once and therefore sacrifices 5 of the relative information 
on every comparison. All comparisons are therefore of equal accuracy. This 
design depends structurally on the complete set of orthogonal 4x4 Latin squares. 

Table 20. Alternative 2* design. 

A ; B ; A.B 

C ; D ; C.D 

A.C; B.D; A.B.C.D 

A. D: A.B.C; B.C.D 

B. C; A.B.D; A.C.D 

$d. General remarks. u- u j 

In agricultural field experiments in randomized blocks a very high degree 
of confounding is not usually advisable ; as a general rule the two-factor 
actions should be left unconfounded. We have, however, thought it worth 
to put on record the possible designs in blocks of 4 plots, both for the sake ot 
completeness and because they may be found to be of use in other prances 0 
biological experimentation where the block size is more definitely limited. 

Balanced arrangements are particularly useful when the expenmental ma ena 
is such that a high degree of confounding is advisable, so that possibly tmpo 
interactions are likely to be involved. In agricultural experiments the 
of replications available is rarely great enough to attain balance in si g 
experiments including large numbers of factors (though balance . 

introduced in sets of experiments of similar design at different places;, 
does not preclude partial confounding, which should always be 
there is more than one replication and when a choice can be made ® , 
interactions of the same order, unless one set can be pronounced . 4 

to be of no importance. Thus in the experiment on beans dwcnbed in e 7 » 
in which the factors were spacing, dung, nitrogen, ® cp^nnd 

three-factor interactions confounded with S.D.P and ^ j ^ wV mifrht 
replication been available the three-factor interactions D.P.K .Lp 

advantageously have been confounded in it. It is instructive to iden iiy 

sets with those given in general form in Table 17. 



27 


6. Estimation of error from high-order interactions. 


A further difficulty which limits the number of factors that can be included 
in an eroeriment is the number of plots reqmred. Thus with six factors 128 
plots wiU be required for even two-fold replication. 

If only a single replication is employed the experiment will not be capable 
of furnishing an estimate of error by the ordinary procedure of comparing 
replicates. There will, however, in large experiments be a number of interactions 
between three or more factors which may in many cases be confidently predicted 
to be small in comparison with the errors affecting them. If this is the case 
they wUl in effect themselves be estimates of experimental error. Thus, for 
example, in a 2* design no less than 22 of the 63 degrees of freedom for treatments 
correspond to interactions between four or more factors. If the experiment 
consists of a single replication and is arranged in blocks of 16 plots, three of these 
will be confounded with block differences. The remaining 19 may then be used 


as an estimate of experimental error. 

It should be noted that even if some of these high-order interactions do 
happen, with some particular set of factors, to be appreciable, the experimenter 
is stiU in a much better position than he would have been had the mteracting 
factor been omitted entirely from the design. For any particular interaction 
(except those which are confounded) which later results may indicate to be of 
importance can be isolated and examined. Moreover the criticism that the 
inclusion of an interaction of some magnitude in the estimate of experimental 
error will inflate that estimate does not carry much weight, since the true 
experimental error (as estimated between replicates of tne same treatment 
combination) would not be applicable to the results of an experiment with the 
interacting factor held constant, if it were intended that these results should 
be treaty as valid for all levels of the interacting factor. 

This device of using only a single replication is particularly useful in 
agricultural field experiments. For it is well known that most of the effects 
which are being measured vary from year to year and place to place. A whole 
set of similar experiments, of moderate accuracy, concfucted at different places 
over a series of years, is thus of far more value for practical purposes than a 
single large experiment of equivalent accuracy. The use of only a single 
replication enables experiments comprising a reasonable number of factors to 
be carried out on ordinary non-experimental farms, and thus very considerably 
adds to the resources of the experimenter. 


7 . An exploratory experiment on beans. 

^ an example of the points discussed in the last two sections we will 
consider a 2* experiment on beans, conducted at Rothamsted in 1935. 

1 he treatments consisted of all combinations of : 

im of rows : 18 ins. apart (rj,) or 24 ins. apart (sj). 

5^) pong : 10 tons per acre (^, or none. 

Nitrochalk : 0.4 cwt. N per acre (n), or none. 
in Superphosphate : 0.6 cwt. P, 0 , per acre (p), or none. 
lA.) Munate of potash: i.o cwt. K^O per acre {k), or none. 


28 


The spacing was included to test the theory that the best effects of manures 
might be obtained with closely spaced rows. 

The plan is shown in Table 21. The yields are given in the first column 
in Table 22. 


Table 21. Plan of experiment on beans, and block totals. 

Block III (555-2) Block IV {436.7) 


Sittk 



i„npk 

Sitlp 

Sidn 

Sidnp 

i,n 

sjp 

Sipk 


j.A 

Sidk 

1 

s^dripk 

j^nk 

sjpk 

h ' 

Sidp 


sji 

Sitlk 


Sidnk 

Stdpk 

S^P 

Sidnpk 


Sidnp 

s^n 

S„pk 

Sid 

Sinpk 


Block I (412.3) Block n (481.0) 


Only a single replication was used, giving 32 plots in all, each of acre, 
these being arranged in four blocks of 8 plots each. Exammation will show 
that the following interactions are confounded : 

Interaction Contrast 

S.D.P . . I - II - III + ly 

S.N.K .. .. i + ii-iii-iy 

D.N.P.K 1 - II + HI - IV 

ya. Analysis. • t, . 1 

The calculation of the main effects and interactions is given in 1 able 22, 

and the analysis of variance in Table 23. .v fortnrs 

The estimate of error is based on interactions between three or more factors^ 

The computations follow exactly the same ^ 
experiment. The sum of squares for treatments is obtained by ding t^ 
of the squares of the totals of Table 22 corresponding the mam 
two-factor interactions by 32 (there being no need to write ^ 

squares), and the other two sums of squares are 7 l 1^ of the whole 

is given in Table 22 for each of the columns (i) to (s). calculated 

set of calculations is provided by the total sum of square, whi 
direct from the yields of the separate treatment combinauons^ 

A further useful check is obtained if the block totals n 

calculating the total sum of squares (as can bv Wocks). The 

of Note 4 when, as often happens, the yields are tabulated by 

confounded interactions can then be calculated directly from these 
and compared with the values already obtained. 




29 


Table 22. 


Calculation of main 


EFFECTS and INTERACTIONS, BEANS I 


t.i'JU 


UMENT. 


Effect 


sdnpk 


Yield (0) 

(0 

(2) 

00000 

$ » 

4 4 

66.5 

102.7 

232.2 

100 

1 f 

4 ^ 

36.2 

129.5 

229.1 

010 

1 $ 

4 9 

74.8 

9*-3 

213. 1 

IXO 

# 9 

$ 4 

54-7 

137-8 

207.4 

OCX 

♦ 4 

; • 

68.0 

86.6 

227-5 

101 

4 4 

i 4 

23-3 

126.5 

297.9 

0X1 

♦ 4 

9 4 

67.3 

82.0 

243.8 

Xtl 

4 4 

• • 

70.5 

125.4 

234.2 

00010 

$ 4 

9 9 

567 

102.9 

- 50-4 

xoo 

1 $ 

4 4 

29.9 

124.6 

- 4 *-S 

010 

$ » 

4 4 

76.7 

131-7 

- 53-7 

1 X 0 

$ • 

4 4 

49.8 

166. 2 

+13-2 

oox 

i 1 

4 4 

3^-3 

123.9 

- 2.3 

XOI 

4 $ 

4 

45-7 

119.9 

+8.1 

01 X 

4 • 

4 4 

60.8 

95-9 

+17.4 

111 

• • 

4 4 

64.6 

138-3 

- 15.8 

OOOOX 

• • 

4 4 

63.6 

- 30-3 

4-26.8 

100 

4 » 

4 « 

39-3 

- 20.1 

4-46.5 

0 X 0 

1 1 

♦ 4 

5«-3 

- 44-7 

4-39.9 

1 X 0 

4 1 

4 4 

73-3 

+ 3-2 

+ 43-4 

oox 

4 4 

4 4 

71.2 

- 26.8 

4-21.7 

101 

4 $ 

f 4 

60.5 

- 26.9 

4 - 34-5 

oil 

4 4 

4 4 

73-7 

-H 9.4 

- 4-0 

111 

1 1 

4 4 

92.5 

+ 3-8 

4-42.4 

000 X 1 

4 i 

$ 4 

49.6 

- 24-3 

4-10.2 

100 

4 4 

4 9 

74-3 

-t-22.0 

4 - 47 -9 

oto 

4 4 

9 4 

63.6 

- 10.7 

- O.I 

110 

4 4 

. • 

56.3 

+18.8 

- 5-6 

001 

« • 

1 • 

48.0 

+24.7 

4-46.3 

101 

4 9 

1 • 

47-9 

- 7-3 

4-29.5 

on 

4 9 

9 $ 

77.0 

- O.I 

32eO 

111 

4 9 

9 9 

61.3 

- 15.7 

- 15.6 

Totals (for checks) : 



ist /Odds(a) 507.1 
half \ Evens (b) 374.7 

817.0 

1068.2 

^ 00 
94 

99 ^ 

4 ^ 

2nd r 

Odds (c) 498.0 

- 102.8 

108.8 

half \ 

.Evens (d) 505.4 

- 22.2 

223.0 


( 3 ) 

(4) 

Effect (5) cwt. per acre 

461.3 

881.8 

1885.2 

21.04 

420.5 

1003.4 

- 125.0' 

» - 2.79 S 

525-4 

- 132-4 

+251. +5.61 D 

478.0 

4 - 7-4 

4-80.6 

4-1.80 S.D 

- 91.9 

4-156.6 

4-52.0 

4-1.16 N 

- 40.5 

4-94.6 

4 - 53-0 

4 - 1 . 18 S.N 

+$.8 

4-52-4 

4-82.4 

4-1.84 D.N 

4-1.6 

4-28.2 

4-31. 8t 

4 - 73-3 

- 8.8 

- 88.2 

-1.97P 

4-83-3 

4-60.8 

4 - 47 -2 

4-1.05 S.P 

4-56.2 

4-75.8 

-7.8 

- 0.17 D.P 

4-38.4 

- 22.8 

- 187-21 

+58.1 

4-23.2 

- 82.6 

- 1.84 N.P 

- 5-7 

4-59.2 

4 - 14 - 4 ' 


4 - 75-8 

4-32.2 

4-17-41 


-47.6 

-0.4 

- 10. oi 


- 3-1 

- 40.8 

4-121.6* 4-2.71 K 

- 5-7 

- 47-4 

4-139.8* 4-3.12 S.K 

4-70.4 

+ 51-4 

- 62.0 

- 1.38 D.K 

- 9.6 

-4.2 

- 24. 2f • 0.54 S.D.K 

+8.9 

4 - 10.0 

4 - 69.6 

4-1. SS N.K 

4-66.9 

- 17.8 

-98-6: 

• 

4-10.4 

-63.8 

+36.0'* 

• 

- 33-2 

- 123.4 

- 32.61 


4 - 19-7 

- 2.6 

- 6.6 

-o.is P.K 

4 - 3-5 

- 80.0 

- 55-6 


4-12.8 

4-58.0 

-27.8 


4-46.4 

- 43-6 

- 59-6 


4 - 37 -7 

-- z6.2 

- 77 - 4 ' 


“ 5-5 

4 - 33-6 

- 101.6' 


- 16.8 

- 43-2 

4 - 49 - 8 : 


4-16.4 

4 - 33-2 

4-76.41 




±51* 

±1.14 

1164.0 

1080.8 

2109.6 


928.0 

1230.4 

- 95-2 


140.0 

" 47-2 

103.2 


79-2 

- 249.6 

- 156.0 



Checks for column (0 ^‘-+ bo+ co+ 

. ' Ui+ di= - ao+ bo - co+ do 

and similarly for the other columns. 


MC 5 BTMter thu in or 248 cwt 

+nJrt^‘ ^ ?“*• thn rss or 3.46 cwt 

TUiod for estimate of enor. tConfounded with blodt 


Ma.> 



30 


Table 23. Analysis op variance, beans experiment. 


Blocks 

D.F. 

3 

Sum of squares 
1476.43 . . 

Mean square 
492.14 

Main effects and interactions between 
two factoR 

15 

4921.20 

328.08 

Remainder 

13 

1066.64 

82.05 

Total 

31 

7464.27 



Spacing, dung, and potash have produced significant effects, and in addition 
the interaction between spacing and potash is significant. It is to be noted that 
the dung and spacing show a similar (though smaller and non-significant) 
interaction. The table (Table 24) including these three factors is therefore of 
interest. It is not affected by the confounding, and may be constructed either 
from the main effects and interactions or by taking Ae mean yields of the 
relevant sets of 4 plots. 


Table 24. Mean yields, cwt. per acre. 



(0 

k 

d 

dk 

18 m. spacing 

1 

20.3 

20.7 

25.0 

23-7 

24 in. spacing 

• • • > 

12.1 

19.8 

21.4 

2 S -3 


Ihe expenment is not 01 mgn precision, oeuig 01 oiuy anv* 

a high standard error per plot (beans have at Rothamsted provM a very variable 
crop), but in combination with other similar experiments it should provide useful 
information, and in itself affords an illustration of the importonce of putting 
theories to experimental test, since the interaction between spacing and manures 
turned out to be the opposite of what had been expected. 

jb. Gain in precision due to confounding. 

It is clear that the arrangement in blocks has increased the precision, sii^ 
the mean square for blocks is considerably greater than that for error, 
estimate of the amount of this gain can be made by replacing the treatm 
mean square by the error mean square, and then calculating what the , 

have been had there been no confounding. (This procedure apumes . 
confounded interactions are negligible, and is, of course, subject to 

errors of estimation.) e 

The calculations are set out in full in Table 25. The estimate 


Table 25. 
Blocks 

Within blocks 


Gain in precision due to confounding. 


D.F. 

Sum of squares 

Mean square 

3 

147643 

492.14 

28 

2297.40 

82.05 


Total 

error mean square for a block 
unconfounded arrsngemcnt is 


.. 31 3773-83 

of 32 plots is 121.74, and the efficiency 0 a 
therefore 82.05/121.74, or 67.4 per cent. 



31 


reciprocal of this is 148.4 per cent, and the gain in information due to con- 
founding is thus 48.4 per cent. .L L 1 f tU 

It should be noted that if there is more than one replication, the whole of the 
sum of squares for blocks will not enter into the new estimate for error ; only those 
components which represent differences of blocks forming the same replication 
must be included. 


8. Confounding in Latin square designs with factors at two levels. 

In a somewhat limited number of cases it is possible to adapt confounding 
to Latin square designs. Thus, for example, a 2* system involving 16 treatment 
combinations may be arranged in an 8 x 8 Latin square, there being four complete 
replications. Any one degree of freedom for a main effect or interaction may be 
confounded with rows (the rows being taken to represent blocks of 8 plots each), 
and at the same time another degree of freedom for a main effect or interaction 
may be confounded with columns. Alternatively partial confounding may be 
adopted, each of the 4 degrees of freedom for three-factor interactions being 
confounded in one of the four row-pairs, and the four-factor interaction being 
completely confounded in the four column-pairs. Three-quarters of the relative 
information will then be available on all tl^ee-factor interactions. 

At the outset there is one point which should be emphasized. In order to 
obtain an unbiased estimate of error from a Latin square it is necessary to 
rearrange all rows in random order, and also all columns. Thus we are precluded 
from so arranging the experiment that the rows (or columns) forming each 
complete replication necessarily fall together in the field. This restriction is 
of importance in the types of design discussed in Section 16/ and 16^, in which 
main effects such as varieties are confounded. 

In spite of these limitations, such confounded Latin square designs as exist 
are of considerable interest, in view of the markedly greater precision of Latin 
squares as cornpared with randomized blocks in many types of agricultural field 
trials. We will therefore give examples which wll illustrate the possibilities 
and limitations of this method of design. In this section we shall consider the 
various types which are applicable to sets of factors at two levels only. These 
mmt clearly utilize 4x4 and 8x8 squares. Further examples utilizing 6x6 
and 9x9 squares will be given later. 


8 a. 2x2x2 design in two 4x4 Latin squares. 

Sinre we may arrange a 2® design in blocks of 4 plots in such a way as to 
^ntound ^y smgle degree of freedom, we may, in a single 4x4 square, 
totally confound any two interaction degrees of freedom, one wth rows and 
one with column, or alternatively we may partially confound two degrees with 

coli^. ^ in any case, however, at least two 
squares will necessary to provide an adequate estimate of error, it is simoler 


32 


may be confounded in both squares, or N.P.K may be confounded with the 
columns of both, P.K with the rows of one, and N.P and N.K partially with 
the rows of the other. With three squares N.P.K may be confounded with 
the columns of all three squares, and N.P, N.K and P.K with the rows of one 
square each, thus obtaining f, the relative information on all two-factor inter- 
actions. Alternatively, if the two-factor interactions and the main effects are 
of equal interest, these may each be confounded in one half of one square, 
N.P.K being confounded in all squares, giving the relative information on 
all effects except N.P.K. 

The necessary designs are easily constructed by writing down the sets of 
treatment combinations that must fall together in the rows and the similar sets 
that must fall together in the columns. Thus to confound P.K with the rows 
and N.P.K with the columns the rows must consist of the two sets 

(i) n pk npk 

p k np nk 

and the columns of the two sets 

(I) H 

np p 

nk k 

pk npk 

This gives the following alternative squares (Table 26) with the first row and 

the first column in an assigned order : 

Table 26. 

(i) n pk npk (0 « "M 

np p nk k np k nk p 

nk k np p nk p np k 

pk npk (i) n pk npk (i) ” 

For each square of the experiment one of the two squares may be selected at 
random, both the rows and columns being arranged in raridom order. 

An alternative arrangement, which avoids confounding any two-iactor 
interaction, is also worth noting. If the four treatment combinations (i), «p. 
nk, pk, be arranged in a single 4x4 Latin square, and the. other four combina ions 
n, p, k, npk, in a second square, then the three-factor inter^ion 
be identical with the comparison between the two squares. This arrange 
has the defect that any differences in response to one of the factors, n say, 
the two squares will give rise to an apparent interaction between the remai g 
factors p and k. This defect may be overcome, however, though with some 
probable loss of efficiency, by interlacing the two square, one of each p 
columns (if there are eight columns) being assigned at random to ^ ' 

Thus after randomization we might arrive at the arrangement given m /• 


Table 27. 


(0 

k 

pk 

P 

npk 

np 

nk 

n 

nk 

npk 

np 

n 

k 

pk 

(0 

P 

pk 

n 

0) 

k 

P 

nk 

np 

npk 

np 

P 

nk 

npk 

n 

(0 

pk 

k 


analysis will be conducted just as it would be if the lately! 

aced. eliminatine the rows as well as the columns of each squat p 


The 

interlaced, eliminating 



83 


Bh. Numerical example. 

The above designs were superimposed on a uniformity trial on sugar beet 

conducted by Immer (17). , . . ^ ■ r 

Table 28 shows the actual arrangement derived by randomization trom 

Table 26 (the second square being selected in each case), and the yields of each 
plot (bV acre). and N.P.K were confounded in both squares. The degree 
of freedom confounded with rows falso assigned at random from the above two) 
was NP.K in the first square ana P.K in the second. 


Table 28. Plan and yields m lb. 


k 

n 

P 

npk 

P 

np 

nk 

k 

542 

587 

583 

576 

549 

562 

576 

569 

nk 

pk 

np 

(0 

n 

pk 

(0 

npk 

629 

615 

634 

594 

637 

623 

643 

629 

np 

(I) 

nk 

pk 

npk 

(I) 

pk 

n 

562 

596 

624 


639 

628 

645 

651 

P 

npk 

k 

n 

k 

nk 

np 

P 

604 

638 

609 

634 

615 

586 

605 

6:8 


The following estimates of the treatment effects (totals over 32 plots) were 
obtained : 

N~ + 109, P- - n, /C- + 55, N.P~ - 147, N.K^ - 5. 

The analysis of variance is given in Table 29. 


Table 29. Analysis of variance, separate squares. 



D.F. 

Sum of squares 

I^ean square 

Squares . . 

I 

4 S 7-5 

457-5 

Rows 

6 

20488.4 

3414-7 

Columns . , 

6 

2797.9 

466.3 

Treatments 

S 

1145-7 

229.1 

Error 

*3 

3460.6 

266.2 

Total 

3 r 

28350.1 



swiiudru error oi eacn oi me aoove estimates is therefore +021 
No one of the effects is significant. ^ 

The analysis of variance appropriate to the arrangement in interlaced 
squares given in Table 27 is shown m Table 30. 


Squar« ( « N.P.K) 
Rows 

Columns . . 
Treatments.. 

Error 

Total 


as OP variance, interlaced squares. 

D.F. 

Sum of squares 

Mean square 

I 

44*-5 

44 *-S 

6 

18540.4 

3090.1 

6 

.. 2812.9 

468.8 

6 

2694.9 - 

449.2 

12 

3859-4 

321.6 

31 

28350.1 



34 


It will be noted that in this example rows have been very effective in 
eliminating soil heterogeneity. Table 31 shows the mean squares obtained with 
squares and blocks of various types : 


Table 31. Efficiency of various arrangements. 






Relative 



D.F. 

Mean square 

efficiency 

4x4 Latin squares 

( separate 
\ interlaced 

18 

18 

255.9* 

364.1* 

100.0 

70.3 


r half-rows 

24 

.. 308.5 

82.9 

Blocks of 4 plots 

. < columns 

24 

1045.6 

24-5 


1 2 X 2 squares 

24 

.. 940.7 

27.2 


r rows . . 

28 

.. 407.6 

62.8 

Blocks of 8 plots 

. < pairs of half-rows 

28 

.. 867.7 

29 s 

1 pairs of columns 

28 

.. 949.1 

27.0 

Blocks of 16 plots , 

r pairs of rows 
\ squares 

30 

30 

.. 829.5 

.. 929.8 

30.8 

27-5 


•Trcitments + error of Table! 29 and 30 


The major part of the soil heterogeneity lies in differences between rows, 
and consequently blocks along the rows are reasonably efficient. They are, 
however, a form of block which would not in practice be used um^s prior 
information on the fertility differences of the field was available. The alter- 
native forms of block, whether of 4 or 8 plots, have all efficiencies of less than 
30 per cent. The arrangement in interlaced squares is somewhat less emcient 
than the arrangement in separate squares, but has served to eliminate the greater 
part of the variation due to rows. 

It is not claimed that this example is typical of the average gain in efficiency 
that may be expected from the use of Latin squares instead of randomized blocks. 
It is, however, an excellent illustration of the power of Latin squares to dea 
with the types of soil heterogeneity met with in agriculture. I" 
it should be noted that if we have any type of experimental i^^ter al which can 
be classified in two ways, with both of which variation is ^ . 

elimination of both sources of variation simultaneously nat on 

decrease in error variance over the average of that produced chm nag 

of either source separately. Measured in terms of ^ 1 4 ;^ by 

is equal to the reciprocal of the error variance per plot) the additiona g y 

the simultaneous elimination of both sources is even greater. 

It is also to be remarked that if the variation associated 
classification is large, while that associated with a second ^ ^han 

use ot the second classification for blocks will always 

if the experiment were arranged wholly at random. ^ P . information 
the elimination of columns after eliminaung rows mcrea . ^^ji^^inating 
per plot from 82.9 to 100, whereas the elimination of columns 

rows has decreased it from 27.5 to 24.5. 



36 


8 c. Arrangements for five and six factors m on 8 x 8 square. 

The arrangement of five and six factors in 4 x 4 squares is also possible 
if the confounding of some of the two-factor interactions is permitted, but the 
use of an 8 X 8 square appears more suitable, since all two-factor interactions 

can then be kept free from confounding. 

In the case of five factors, groups or sets may be chosen from those shown 
in Table 17. If only a single square is available, partial confounding within 
the square may suitably be resorted to, four out of the five sets being confounded, 
two with rows and two with columns. In the square shown in Table 33 the 
first group of Table 17 is confounded in rows 1 — 4, the second in rows 5 — 8, 
the third in columns 1—4 and the fourth in columns 5—8, the fifth group 
being unconfounded. In this table the first of the pair of numbers gives the 
combination of the a, b, and c treatments, according to the scheme : 

I* (r), 2« a, 3= 6 , 4* ab, 5= c, 6- ac, 7= be, 8= abe, 
and the second of the pair of numbers gives the d and e treatments, according 
to the scheme : 

I- (i), 2- d, 3- e, 4- de. 

Thus 72- bed. 

Table 32. 8x8 quasi-Latin square for five factors. 


11 

43 

7 * 

63 

42 

62 

74 


73 

61 

*3 

4 * 

72 

12 

44 

64 

54 

82 

34 

22 

83 

23 

3 * 

SI 

32 

24 

52 

84 

33 

53 

81 

21 

81 

53 

64 

72 

11 

34 

22 

43 

62 

74 

83 

5 * 

24 

41 

*3 

32 

44 

12 

21 

33 

54 

71 

63 

82 

*3 

3 * 

42 

*4 

61 

84 

52 

73 


The analysis follows the ordinary lines, the partially confounded interactions 
being computed from the rows or columns in which they are unconfounded. 
There are thus 18 degrees of freedom for error. As before rows and columns 
must be completely randomized amongst themselves. 

In the case of six factors the system of confounding will be of the type : 
Rows: A.C.E; A.D.F; B.D.E; B.C.F; A.B.C.D; A.B.E.F; C.D.E.F 
^lumns : A.B.F ; A.D.E; B.C.D; C.E.F ; A.B.C.E; A.C.D.F; B.D.E.F 
The squpe shown Table 33 confounds these interactions. The second number 
now indicates one of the eight combinations of d, e and /. 


Table 33. 

8 

X 8 quasi-Latin 

SQUARE 

FOR 

SB FACTORS. 

It 

24 

36 

47 

58 

85 

73 

82 

27 

16 

44 

31 

62 

S 3 

85 

78 

38 

45 

*3 

22 

71 

84 

56 

87 

42 

33 

25 

18 

87 

76 

64 

$1 

54 

61 

V 

86 

15 

28 

32 

43 

66 

57 

81 

74 

23 

12 

48 

35 

75 

88 

52 

63 

34 

41 

17 

26 

83 

72 

68 

55 

46 

37 

21 

14 


set 


If 128 plots are available, a second square confounding a completely different 
ot three-factor mteractions may be obtained from the above square by 


36 


changing a to c, c to /, / to e, and e to a. Two four-factor interactions will be 
confounded in both squares. 

With only a single replication error will have to be estimated from high- 
order interactions. If all 12 unconfounded three-factor interactions are retained 
there will remain 16 degrees of freedom for error. 

The actual factor which each letter is taken to represent in these designs 
must, of course, depend on the interest which attaches to the various interactions, 
the aim being to confound {as far as is possible) only those interactions which 
are likely to be of little importance. 

The rows and columns of each square must be rearranged in random order 
for every experiment. 

9. Factors at more than two levels. 

In the preceding sections we have described factorial designs in which every 
factor is at two levels only. Many cases arise in practice, however, in which 
more than two levels of some or all of the factors are required. In all cases 
in which it is necessary to determine the optimal level of a factor, for instance, 
at least three levels are essential, and in factorial experiments in which varieties 
are included as one of the factors the use of three varieties rather than two is 
usually advisable. 

When some or all of the factors are at more than two levels, part of the 
simplicity that attaches to factorial designs with factors at two levels only is lost. 
To the main effects of a factor at four levels, for instance, there will correspond 
3 degrees of freedom, and similarly for all interactions involving this factor. 
The calculations required for the analysis of variance are consequently more 
complicated. Moreover the possibilities of confounding are much more 
restricted, and the designs which exist are less elegant and more troublesome 
statistically, particularly with factors at different numbers of levels. 

In this section we will consider the modifications that are necessary in the 
statistical analysis when there is no confounding. In later sections the simpler 
types of confounding will be described. 

ga. Two factors. 

In a varietal and manuring experiment on oats (Rothamsted, 1931) tour 
levels of nitrogen (0, 0.2, 0.4 and 0.6 cwt. per acre) were applied to each 0 
three varieties. Victory, Golden Rain II and Marvellous. There were six 
replicates on acre plots. The total yields of each of the twelye treatment 
combinations are given in Table 34. 

Table 34. Varietal and manurial experiment on oats. 

Treatment totala (J lb.) 




«i 


"3 

Total 

Victory 

Golden Rain II 

Marvellous 

429 

480 

520 

538 

59 * 

65* 

665 

688 

703 

n 

2343 

2508 

2635 

Total 

.1429 

1780 

1 2056 

1 2221 

7486 



37 


Since there are twelve treatment combinations there must be ii degrees 
of freedom for treatments. These can , as before, be divided into main effects 

and interactions. 

There will be 3 degrees of freedom for the main effects of and 2 degrees 
of freedom for the varietal differences. This leaves 6 degrees of freedom for 
interactions. (Note that 6 = 3 x 2). 

If (as is natural here) the main effects are defined as the average response 
to one factor at aU levels of the other they will be derivable from ^e two sets 
of marginal totals of Table 34. The sums of squares corresponding to each 
set can be calculated in the ordinary manner from the sum of the squares of the 
deviations of these marginal totals, dividing by the number of plots in each. 
Thus the sum of squares for N is given by 

[1429* + 1780- + 2056® + 2221® - 18 X 778336.06] 

(Note the method, the most suitable for a calculating machine, of applying the 
correction for the mean. This corrertion, 7486=* /ya, should be calculated first 
and written down, as it is wanted repeatedly.) 

The sum of squares for interactions cannot be conveniently calculated 
directly, and must tnerefore be obtained by subtraction from the total sum of 
squares for treatments. The full analysis is as follows (Table 35) : 


Table 35. Partition of the treatment sum of squares in the varietal and 

MANURIAL TRIAL. 


Correction for mean 

Nitrogen . . 
Varictio . . 
Interactione 

All treatments 


D.F. 

3 

z 

6 

11 


Sum of squares 
778336 06 

20020.50 
1786.36 
3 * 1 -75 


Mean square 

6673.50 

893.18 

53-63 


22128.61 


'Diere is no automatic check on this table, and all the computations must 
therefore be carefully checked. 

f *1.^^ ^ that the above computations are exactly analogous to those 

ot die ord^ry analysis of variance of a randomized block experiment. Nitrogen 
and wneties coreespond to blocks and treatments, interactions to error, and all 

^ squares are divided 

^ value of Table 34 is the 

We wiif discuss the layout and conclusions of this experiment in Section i 6 b. 
9 l>- Three or more factors. 

samelin'® **’*/^™ /^ysis to three or more factors foUows on the 
eil. Ih? ' 4 leveb and c at 4 levels, 

degrees 0? frerfom'jSl Petition <>f 


A.B.C 18. 


A 2 

A.B 

6 

B3 

A.C 

6 

C3 

B.C 

9 


» 


38 


In order to calculate the sums of squares three two-way tables will be 
required, one between each of the three pairs of factors, the sums being taken 
over all the remaining factors. Each set of marginal totals occurs twice, thus 
providing useful checks on the construction of the table. These three tables 
will give the sums of squares for the main effects and interactions between two 
factors. The sum of squares for the interaction between all three factors can 
then be obtained by subtraction. 

pc. Simplification when one of the factors is at two levels only. 

If one of the factors is at two levels only the interactions of this factor with 
the others can be calculated directly by using the differences of the yields at 
the two levels of this factor for all combinations of the other factors, and analysing 
these in exactly the same manner as the totals of the yields at the two levels. 
In the case of two factors only the calculations can be arranged as in Table 36, 
which gives the total yields in pounds of the five replicates (jV acre plots) of an 
experiment on different proportions of oats and vetches in a forage mixture, both 
with and without nitrogen (Rothamsted, 1932). 


Table 36. Experiment on seed mixtures and nitrogen. 



200 oats 

No vetches 

Seeding rate 
150 oats 

50 vetches 

s (lb. per acn 
100 oats 
100 vetches 

e)- 

50 oats 

150 vetches 

No oats 
200 vetches 

Total 

Without nitrogen 
With nitrogen 

1405 

1788 

1661 

1979 

1788 

2000 

1684 

1792 

1342 

1468 

7880 

9027 

Sum 

Difference 

3193 
+ 383 

3640 

+ 318 

3788 
+ 212 

1 

1 

3476 

+ 108 

1 

2810 
+ 126 

16907 

+1147 


The sum of squ^es for N is given by 1 147 75 °. 
interactions is given by 

to[383“ + 3i8“+. . . 229.4 X 1147] 

Table 37 shows the full analysis of variance. 


Table 37. 

Analysis op variance op experiment 

ON SEED RATES. 


D.F. 

Sum of squares 

Mean square 

Correction for mean 


5716933-0 



Seedings .. 
Nitrogen 

4 

I 

60313.9 

26312.2 

15078.5 

26312.2 

1429.4 

Treatments ^ 

Interactions. . 

4 

S 7 > 7-5 

1 

Total . . 

9 

92343-6 

14900.5 

788.5 

Blocks 


4 

59601.9 

Error . . . 


36 

28384.5 

Total . . . 


49 

180330.0 




39 


Provided that the correction for the mean is computed twice, and that in 
calculating the interaction sum of squares the correction for the mean difference 
(equal to me sum of squares for N) is recomputed as shown, all the treatment 
Sims of squares and the sums and differences of Table 36 are checked by 
computing the total sum of squares from the 10 values in the body of the table. 


gd. Procedure token two or more factors are at two levels only. 

The main effects and interactions involving the factors at two levels only 
may be computed by the method of Section. 3 for each combination of the other 
foctors. The analysis of these and their totals over the different levels of the 
other factors wiU give all the sums of squares required. 

An example will make the procedure clear. The first three columns of 
Table 38 shows the total yields of the treatment combinations of a 3 x 2 x 2 
experiment on potatoes (Rothamsted, 1933). All combinations of 


' Ho > no artificial nitrogen 
. R, B sulphate of ammonia 
R, s ammonia bicarbonate 


{ (i) »no poultry manure 
m = poultry manure 


(1) >no super 
p =super 


were applied. There were three replicates on plots of 55 acre. The arrange- 
ment was confounded in blocks of 6 plots, and is discussed in Section 13c. 


Table 38. Computation of main effects and interachons of a 3 x 2 x 2 experiment. 



Yields (lb.) 

tio iti ft» 

Total 

ffo 

ni 

fit 

Total 

no 

Effects 

nt 

m 

Total 


(0 

4 " 

479 

45 « 

1341 

855 

1057 

968 

2880 

2073 

2361 

2115 

6549 

Sum 

p 

444 

578 

517 

>539 

1218 

>304 

1147 

3669 

+129 

+8S 

+121 

+335 

P 

m 

561 

659 

546 

1766 

+33 

+99 

+66 

+198 

+363 

+247 

+>79 

+789 

M 

mp 

657 

645 

601 

1903 

+96 

- >4 

+55 

+>37 

+63 

-113 

- II 

- 61 

P.M 


The sums and differences of pairs of values in the first four columns are 
shown in the next four columns, and the sums and differences of these latter 
in the last four columns, which give the totals of the main effects and interaction 
of p and m for «o, n, and nj, and the total of all n. The total column forms 
a check on the operation at each stage. 

The treatment sums of squares can now be calculated immediately. The 
correction for the mean is given by 6549*736, the sum of squares for N by 

Tz [2073* + 2361* + 2115* - 6549 X 2183], 

the sum of squares for P by 335*736, the sum of squares for P.N by 

TZ [129* + 85* + 121* -335 X 111.66667] 

and so on. 


Th«e sums of squares are set out in Table 39. The whole calculation is 

checked by calculating the treatment sum of squares from the individual treatment 
combmations. 

In this particular experiment the degrees of freedom for M and P.M.N 
were Mrtially confounded, so the sums of squares for these degrees of freedom 
m 1 able 39 are not those that appear in the final analysis described later. 


40 


Table 39. Partition of treatment sum of squares. 


Correction for mean 

4 « 

D.F. 

» 1 

Sum of squares 
X191372.2 

Mean square 

N 


2 

4034 0 

2017.0 

P 

• • 

I 

3 ” 7-4 

3 ” 7-4 

P.N 

% 9 

2 

91.5 

45-8 

M 

9 « 

I 

17292.2 

17292.2 

M.N 

• 9 

2 

1442.7 

721.4 

P.M 

% • 

I 

103.4 

103-4 

P . M.N . . 

# 9 

2 

1301.5 

650.8 

Total 

9 9 

IX 

27382.7 



If in the sununary of the results two-way tables giving the yields of pairs 
of factors are required, that for the pairs of factors p and m can be derived 
immediately by conversion of the first total column of Table 38, while that 
for n and m can be derived by the conversion of the first two lines of the second 
set of four columns and the first line of the last set. Only that for the pair 
of factors n and p will require any fresh summations. 

ge. Two factors at three levels : formal subdivision of interactions m a 3 x 3 table. 

If the yield totals of the 9 treatment combinations are denoted by the 
numbers 1-9 according to the scheme of Table 40 : 


Table 40. Yield totals. 



bo 

bt 

6a 

do 

1 

4 

7 

di 

2 

s 

8 

aa 

3 

6 

9 


what may be called the two sets of diagonal totals of this table may be defined as 


[/i] = I + 5 + 9 LJi] “ 1 + 6 + 8 
[/a] = 2 + 6 + 7 = 2 + 4 + 9 

[/3]=3 + 4 + 8 [Jj)=3 + 5 + 7 


The four degrees of freedom for the interactions of a 3 x 3 toble may be 
divided into two orthogonal pairs of degrees of freedom, for which the sums 
of squares are given by the appropriate fraction of the sums of the squares 
the deviations of [/] and of [J] respectively, just as the sums 
main effects are derived from [A] and [ 5 ]. Equally a table of the mea y 
of the treatment combinations can be constructed from a knowledge L J. 
[■B], and [J], or the corresponding means. Thus, for example, wi 

replications, ^ ^ 

fliAg ^ devi4i + dev + dev /3 + dtvji + mean 

This formal subdivision provides a useful method of *i,g 

interactions of a single 3 x 3 iable. The method is distinctly shorter than the 



41 


ordinary method of subtraction, since the whole computation then becomes 
self-checking. The analogous subdivision of the three-factor interactions in a 
3x3x3 d5i^ will be described when dealing with the confounding of this 

The conventional numbering of the o treatment combinations of a pair 
of three-level factors given in Table 40 will be extensively used in subsequent 
pages. It should therefore be memorized. Note that the first factor is always 
wntten downwards. 

p/. Example. 

In an experiment on the manuring of meadow hay (Bakewell, 1935) the 
treatments (nothing, compost, and equivalent artificials) followed a two-year 
CTcle, making 9 treatment combinations in all. The 1935 yields are given in 
Table 41. The marginal and diagonal totals are also shown in this table. 

Table 41. Yields of hay in 1935 in lb. (totals of 4 plots of ify acre). 

1933 and 1935 I 1932 and 1934 treatments I Diagonal totals 


treatments 


Artificials Compost Total 


Nil . 

Artiiiciak 

Compost 

Total . . 


65.2 

104.2 

94-5 


71.0 

101.0 
84.8 


263.9 256.8 


85.2 

112.2 

108.2 

305.6 



287.5 

826.3 


277.7 


280.7 


The partition of the treatment sum of squares is shown in Table 42. 


Table 42. Partition of treatment sum of squares. 

D.F. Sum of squares Mean square 
1932 and 1934 ttcatments .. 2 115-85 57.92 


1933 and 1935 treatments 
Interactions 


402.20 

22.83 


201.10 

5-7* 


All treatments 8 540.89 

Since the subdivision of the interaction degrees of fi-eedom is formal, and does 
not correspond to any expected treatment effects, there is no point in calculating 
the two components of the sum of squares separately. TTie squares of all six 
diagonal totals are summed and 24 (- 2 x 12) times the correction for the mean 
is deducted, before dividing by 12. The fact that the total of the three sums 
of square equab the total sum of squares for treatments checks the whole 
computation. If the interaction sum of squares were not computed dir^v 
every item would have to be checked. ’ 

The error mean square (24 d.f.) was 6.300. Thus there is no evidence 
or any mteraction, and^e effects of the fertilizers in the two years may be 
regard^ as additive. The standard error of a maiginal total is */i2 x 6 300 
or ± 8.70. Consequently the response to artificials applied in 1035 is sienificantlv 

“ '934 show no resicff effect, 
wnereas that of compost is significant and large. 


42 


10 . Confounding with three and four factors each at three levels. 

Both 3x3x3 and 3X3X3X3 experiments can be arranged in blocks of 
9 plots or in 9 X 9 Latin squares, confounding only three-factor interactions. 
These designs are of considerable practical importance in agriculture, and we 
will therefore describe them in detail. 

lOfl. 3x3x3 designs in blocks of 9 plots. 

There are 8 degrees of freedom for the three-factor interactions. These 
can be divided into four orthogonal pairs, each pair being given by the contrasts 
of the sums of three sets of nine plots each. The four groups of three sets are 
given in Table 43, being represented by the four letters W, X, K, Z.* 


Table 43. 3x3x3 designs confounding three-factor interactions, 


Combination 
of first and 
second factors 


1 

2 

3 

4 

5 

6 

7 

8 


00 
10 

20 

01 

XI 

21 

02 
12 

22 


w. 

\Va 

Wj 1 

Xt 

Xi 

Xj 

Yt 

ft 

Y, 

Zi 

Za 

Zj 




Level of third factor 






0 

2 

1 

0 

! 

2 

0 

2 

1 

0 

I 

2 

1 

0 

2 

2 

0 

I 

1 

0 

2 

2 

0 

1 

2 

I 

0 

1 

2 

0 

2 

X 

0 

1 

2 

0 

2 

1 

0 

I 

2 

0 

1 

0 

2 

2 

0 

1 

0 

2 

1 

0 

1 

2 

2 

1 

0 

I 

2 

0 

1 

0 

2 

2 

0 

1 

0 

2 

I 

0 

1 

2 

1 

0 

2 

2 

Q 

I 

2 

I 

0 

I 

2 

0 

2 

I 

0 

1 

2 

0 

0 

2 

1 

0 

1 

2 

0 

2 

1 

0 

1 

2 

.1 

0 

2 

2 

0 

I 


Examination of the table will show that every combination of each pair of 
factors occurs in each set of 9 plots, and consequently if these sets are arr^ged 
in blocks the main effects and two-factor interactions will be unconfounded. 

If more than one replication is available it is best to use different group 
for the different replications, thus partially confounding some or all of the 
three-factor interactions. If four replications are used complete balarice is 
attained, and J of the relative information will be available on all the mree-tactor 
interactions. Partial confounding introduces some additional complication into 
the computations, unless the partially confounded degrees of freedom are allowed 
to remain in the estimate of error, but the difficulties are not great if the metho 
described below is systematically followed. 

lob. Example 0/ a 3 x 3 x 3 design. 

Table 44 gives the plan and yields of sugar in an fxpe^nent on suwr beet 
(Woburn, 1935) in which all combinations of three sowing dates, Apnl 10 ( oj> 

May 9th (di), May 25th three spacings of rows, M 
20 in. (ra), and three levels of sulphate of ammonia, nothing («o). 0.3 c^. P 
acre (rii), and 0.6 cvrt. N per acre (Wa). were included. The experim 

•These groups hive previously been numbered 1, II, III «nd IV in various orders, but no consistent no 
his been esubliehed^ Mwing, March 14 th, failed and * replaced this. 



43 


arranged in 6 blocks of 9 plots each. Since after rejection of edge rows the 
plots of the three spacings were of different area the yields have been converted 
to cwt. per acre before analysis.* 


Table 44. Plan and yields op sugar (cwt. per acre). 


Y: 

425.8 


Y, 

439-9 


Y, 

359-0 


62 

52-2 

90 

3*-3 

5* 

52-7 

20 

36-4 

20 

47-8 

11 

39-4 

90 

35-2 

61 

35-3 

82 

45-4 

32 

29.9 

40 

44-6 

81 

34-4 

3* 

46.0 I 

40 

33-3 

7* 

5*-4 

52 

33-6 

12 

SO-S 

7* 

31-9 

4» 

49-7 

12 

33-6 

10 

47-8 

82 

3* 4 

32 

44-* 1 

70 

25-7 

21 

52-S 

50 

33-0 

5* 

49-3 

4* 

36.6 

9* 

46.2 

62 

4*-4 

60 

47-* , 

9* 

37-6 

80 

47-2 

30 

33-2 

7* 

56.0 

21 

41.8 

4^ 

50.9 

80 

32.4 

61 

38.2 

92 

37-7 

22 

43-0 

10 

39-4 

81 

36.5 

22 

43-1 

30 

38.0 

7* 

34-9 

II 

45-7 

5* 

34-2 

SO 

37-* 

42 

36.0 

92 

34-2 


33-5 

70 

35-4 

3> 

26.6 


Zz 

305-5 


Z, 

3H-3 


Z, 

3*7-8 


The combination of the firat two facton, d and t, on each plot ia given by die first figure, and 
the level of the third &ctor, n, by the second figure. 


TL ^4 vanous steps in the analysis of an experiment of this type are as follows. 
Ihe order given should be adhered to, $0 mat errors may be detected before 
the erroneous values are used in extensive further calculations. 


I. Identify the blocks with the groups and sets given in Table 
check the numbering if this is given. 



or 


2. Set out the totals of the separate treatment combinations in the order 

unn^A r\ ^ith third 

uppermost). This should be done even if there is only a single replication 


•Thia account! for slight differences between 


the results given here and those in the Rothamsted Report. 


44 


Table 45. Yields of separate treatment combinations. 




no 


1 

Hi 



na 



So 


S2 

fo 

St 

5 a 

5 o 

Si 

5 a 

do 

87.2 

77-9 

61.1 

85.1 

86.3 

86.3 

84.1 

86.9 

87.9 

di 

84.2 

70.1 

79.6 

94-3 

86.9 

70.9 

86.1 

82.9 

76.8 

d 2 

71.2 

80.6 

66.5 

72.6 

73*5 

83.8 

74.0 

93-6 

71.9 


3. Calculate the total sum of squares of all the yields of Table 44, the 
correction for the mean (which should be checked), the sum of squares for 
blocks, and the total sum of squares for treatments from Table ^5. The block 
totals are obtained in the course of this calculation, together with a check on 
the total and on the formation of Table 45 (see Note 4). 

4. Calculate the five 3x3 tables given in Table 46a. The first three 
require no comment. The last two give the diagonal totals [ 7 ] and \J\ for the 
3x3 tables for each level of n of Table 45. Marginal totals need not be taken 

out at this stage. 


Table 46. Calcuution of 


(a) Two'way tables. 



So 

Si 

la 

742.8 

do 

256-4 

251.1 

235*3 

di 

264.6 

239 *9 

227.3 

73»-8 

d. 

217.8 

247.7 

222.2 

687.7 


no 

m 

na 


do 

226.2 

mi 

258.9 


di 

233-9 

252.1 

245.8 


d2 

218.3 

229.9 

239-5 


So 

242.6 

252.0 

244.2 

738.8 

St 

228.6 

246.7 

263.4 

738-7 

So 

207.2 

241.0 

236.6 

684.8 

h 

223.8 

255-8 

238.9 


h 

225.9 

254.1 

267.6 


It 

228.7 

’.29.8 

237-7 



247-4 

229.5 

254-5 



22^6 

264.4 

244-9 


h 

202.4 

245.8 

244.8 



678.4 

739-7 

744-2 

1 2162.3 


MAIN EFFECTS AND INTERACTIONS. 


fb) Three-factor interactions. 

Til Total 


[ W ] 

715.6 

694.6 

752.1 

2162.3 

[ X ] 

721.2 

719-4 

721.7 


[ y ] 

Blocks 

756.6 

439-9 

728.9 

425.8 

676.8 

359-0 

1224.7 

[ Y ]' 

316.7 

303.1 

3 » 7-8 

937.6 

3 « 2.53 

[2] 

Blocks 

738.1 

317.8 

0 0 

721-3 

3 H -3 

937-6 

[ Z \' 

420.3 

397-4 

407.0 

1224.7 

408.23 


Standard errors. Totals of 6 : ±8.97; totals of 18 : ± 15-54 




45 


6 Calculate the sums of squares correspondmg to the nine values m Mch 

of the first three tables of Table 46a. These are shown in Table 47. .The nrst 

table for instance, gives the sum of D, 5, and D.S. One set of marginal totals 
of each of these three tables may be obtained in the course of this calculation. 


Table 47. Auxiliary table of treatment sums op squares (ignoring confounding). 


Correctioa for mean 86584.10 

Correction for working mean (L^) .. •• 2430.76 

D,S,D.S 341-52 

D,N,D.N 275.13 

S,N,S.N 329-77 

94-47 

S J07-80 

N ‘50-H 

n „ w/Unconfounded W, X 94-22 

IPartially confounded Y, Z 216.84 


All treatments 905.06 


7. Calculate the sums of squares for the main effects from th^e marginal 
totals (checking that the total of each set is correct), and enter these in Table 47. 

8. Calculate the sums of squares for the four pairs of degrees of freedom 
for the three-factor interactions, keeping separate the unconfounded and partially 
confounded degrees of freedom, and enter these in Table 47. 

9. Subtract the sums of squares for main effects from the sum of all the 
other treatment items of Table 47. This should give the total treatoent sum 
of squares and assures the correctness of all of the preceding calculations which 
involve treatments. 

10. Check the sum of squares for blocks and the total sum of squares. 

If there were no confounding, or if one pair of degrees of freedom were 
completely confounded, the final analysis of variance table could now be 
immediately prepared. With partial confounding, however, the following 
additional steps are necessary. 

1 1 . Enter the block totals corresponding to the confounded pairs of degrees 
of freedom in the proper order in Table 46b, subtract these from the full totals, 
[Y] and [Z], thus obtaining the totals [l^' and [Z]', which include only those 
blocks in which Y and Z respectively are not confounded. (If there is any 
doubt about this process, check one or more of the values by direct totalling 
over the blocks in which the degrees of freedom concerned are not confounded.) 
Calculate the sum of squares from these new totals and enter in Table 47. 
(Note that each set has a different total and therefore a different correction for 
the mean, and that a new divisor, here 9, is required, since only 9 plots are 
mcluded in each total.) The whole of this calculation must be checMa, particular 
attention being paid to seeing that the block totals are entered in their correct places. 

12. Construct the final analysis of variance table shown in Table 48. 


46 


Table 48. Analysis of variance. 


Correction for mean 

Blocks 

D 

S 

N 

D.S 

2).N 

S.N 

n o V 

i/.o.iV confounded 

Error 


Total . . 


D.F. 

Sum of squares 
86584.10 

5 

1950-38 

2 

94-47 

2 

107.80 

2 

150.14 

4 

139-25 

4 

30-52 

4 

7 * -83 

4 

94.22-1 

4 

44.29/ 

22 

295.29 

53 

2978.19 


Mean square 

z 

390.08 

47-24 

0.629* 

53-90 

0.695* 

75 07 

0.861* 

34.81 

0.477 

7-63 

17.96 

0.146 

17-31 

0.127 

13-42 


13. Construct the various summaries of results. Tables for main effects 
and two-factor interactions and their standard errors can be obtained directly 
by conversion of the first three tables of Table 46. The conversion factor is 

here I . 

In this experiment the reduction in error variance by the arrangement in 
blocks is very large. Although much of this reduction results from the difference 
between the two replications, the further reduction due to the use of blocM 
of Q instead of 27, made possible by confounding, is also substantial, the gain 
in information, estimated by the method of Section yb, being 53.1 per cent. 


IOC. Adjusted yields of three-factor combinations. 

Under ordinary circumstances it will not be necessaiy to construct any 
table including all three factors, but should this be required it may best be 


done in two stages : 

(a) assuming the three-factor interactions to be negligible ; 

(b) introducing correcting terms for these interactions. 

The general rule for obuining any value of st^e (a) is to “ 

the appropriate values of the converted wo-way tables 7,°™ 

interactions, deducting the corresponding marginal meam 
of times they are involved (i.e. once with Aree factors, twice wi h out fa^^. 
etc.) and adding the requisite multiple of the general mean. m “o 
example : = 42-73 + 37-70 -t 40-43 - 4 t f - 4> -04 ' 37-^9 + 40-04 40 - 9 °, 

A2.TX being i of 256.4 and 41.27 being iV of 742.0, etc. 

The correcting terms for the 

obtainable from Table 46b by multiplying [W] and [X] y . 

Sr^for .8 plots (hereby,) /nd [i1\and [^J^^by die convemion 

9 j)lots (here i). Since rfo^o«o occurs m Ax, ana 

™ - 40-90 + 39-76 -t 40-07 + 3 S ->9 + 46-70- 4 ■< 4°-°4 - 42-46. 

the mean of the means of T and Z' being equal to the general mean. 



47 


Alternatively corrections may be applied to the individual plot yields so 
as to eliminate the block effects, as in Section 4/. These are derived from 
Table 46b, that for block 7 ,, .for instance, being 

.1 (316.7 - 312-53 - 439-9 + 408.23) = - 3.06. 

Similarly that for block Z, is + 0.76, and consequently the adjusted yield of 
rfo5o«o is (from Table 45) 

H87.2-3-06+ 0.76)= 42.45. 

To prevent the accumulation of small errors and facilitate checking it is best 
to retain an additional figure in this calculation, as shown. When the whole 
table is required the computation can be shortened in various ways, the details 
of which may be left to the reader. 

The standard errors of the various differences can be obtained by considering 
which of the interaction effects IV, X, Y and Z are involved, remembering that 
each difference is made up of the sum of 9 components, representing main 
effects, and two- and three-factor interactions. Thus djS^rii and occur 

in the same Z set, but in different W, X, and Y sets. The relative information 
on y is and consequently the variance as ordinarily calculated must be 
increased in the ratio 

(8.^+1. I) : I- 10 : 9. 

Similarly diSo«o 2nd </o^o«o occur in different W, X, Y and Z sets, so that 
the variance of their difference must be increased in the ratio 11:9. Had there 
been four replications, with \ information on W, X, Y and Z, the ratios would 
have been 10 : 9 and 31 : 27 respectively. 

The calculation of separate components of the three-factor interactions is 
discussed in the next section. 


lod. 3 X 3 X 3 X 3 desi^ in blocks of 9 plots. 

Designs with four factors (but not more) at three levels can be arranged 
in blocks of 9 plots in a similar manner to designs with three factors, confounding 
only three-factor interactions. Consequently, if 81 plots are available, the 
possibility of including an additional factor should always be borne in mind, 
since this entails no loss of accuracy owing to increase in block size and little 
additional complication in the computations. 

i, degrees of freedom for three-factor interactions. These can 

be divided m vanous ways into 4 groups of 8 degrees of freedom each, in such 
a manner that each group of 8 degrees of freedom is given by the contrasts of 
? 9 tr®2^ent wmbinations. One such group of sets is shown in Table 40. 

in this table the combmations of the third and fourth factors are also represented 
by the numbers 1-9. Thus the fourth combination of the second set of the 

j 47 . which represents the combination aJt.cJ.,. 

The table is used m an exactly similar manner to Table 43. 

variance follows the same lines as that of the 3 x 3 x ^ 

while replication, however, it is scarcely worth 

mputmg every item of the andysis of variance separately. The sums 


48 


of squares for the main effects and two-factor interactions may be calculated 
from two-way tables in the ordinary manner. The three-factor interactions 
between any set of three factors which are judged to be of interest may also 
be eliminated from the estimate of error if desired. A pair of degrees of freedom 
out of any such set of 8 is confounded with blocks. 


Table 49. Set of 3* designs confounding three-factor interactions. 


CombmatioD of 
first and second 
factors 

Combination of thi 

rd and fourth factors 

I 

11 

1 00 

2 10 

3 20 

4 01 

5 ” 

6 21 

7 02 

8 12 

9 22 

159834672 

591348726 

915483267 

672159834 

726591348 

267915483 

834672159 

348726591 

483267915 

195627843 

951276438 

519762384 

843195627 

43895*276 

3845*9762 

627843195 

276438951 

7623845*9 

Confounded degrees 
of freedom 

a 4 .B.C { W ), A . B.D (y) 
A . C.D (Z), B . C.D { X ) 

a 4 .B.C (X). A . B.D (Z) 
A . C.D (HO, B . C.D (Y) 


III 

IV 

1 00 

2 10 

3 20 

4 O' 

5 " 

6 21 

7 02 

8 12 

9 22 

168924573 

681249735 

816492357 

9 2 4 ' 5 7 3 ' 6 8 

249735681 

492357816 

573168924 

735681249 

357816492 

186537942 

861375429 

618753294 

537942186 

375429861 

753294618 

^42186537 

429861375 

294618753 

Confounded degrees 
of freedom 

A . B.C (Z), A . B.D (HO 
A . C.D { X ), B . C.D { W ) 

A . B.C in XB.D (X) 
A . C.D ( Y ). B . C.D (Z) 


If the totals of the blocks of any g™uping (taken m the om« snow^- 
arranged in a two-way table in the sttndard otda (Tab 4 ). ^ 

totals give the cordounded degrees of freed^^^ ^ Thl acwal pairs confounded are 
the 1 totals A.B.D, and the J totals 1 he acmai pai 

given in Table 49 ; t^ c» a - be contains 

of the sets of totals, [W], [X], J or y, jf no three- 

whole blocks in each tot^ mstead of ‘''vee plots from ewh b . 

factor interactions are elunmated there will be 40 degrees or tree 
if ail Me eliminated there will be 16 degrees of freedom for error. 

loe. 3= ‘>«<f 3 ‘ m ■ -^en for confounding i 


in 

9 



40 




quasi-Latin squares, only three-factor interactions being confounded. 
Arriigemcnts of this type are shown in Tables 50 and 51. Rows and columns 
must be randomized as u-ual. Partial confounding could be adopted m the 3 
design but is scarcely worth while in a single square, since | the relative 
infomation must be sacrificed on two of the pairs of degrees of freedom. 


Table 50 3x3x3 design in a 9 x 9 quasi-Latin square. 

10 21 32 41 52 60 72 80 91 

21 32 10 52 60 41 91 72 80 

32 10 21 60 41 52 80 91 72 

42 61 50 92 81 70 30 11 22 

50 42 61 70 92 8x It 22 30 

61 50 42 81 70 92 22 30 II 

71 90 82 12 31 20 40 62 51 

82 71 90 20 12 31 62 51 40 

90 82 71 31 20 12 51 40 62 

Confounded degrees of freedom : rows, Y ; columns, W (Table 43). 



Table 51. 3* design in a 9 x 9 quasi-Latin square. 

n 29 3s 48 S4 63 76 82 97 

28 34 13 56 62 47 81 99 75 

36 12 27 61 49 55 9« 74 83 

45 51 69 73 88 94 17 26 32 

53 68 44 87 96 72 25 31 19 

67 46 52 95 71 89 33 18 24 

79 85 91 14 23 38 42 57 66 

84 93 78 22 37 *6 59 65 41 

92 77 86 39 15 21 64 43 58 

Confounded degrees of freedom : rows, II ; columns, IV (Table 49). 


lof. Extension to 3" in blocks of 3"-* or 3" ^. 

bi Table 43 we replace each level the third factor by a set of three 
combmations of a third and a fourth factors, such that, in the previous notation, 

w/ 3+ 4+ 8 ^ sets), then the contrast of 

and IVg, etc., will represent a pair of degrees of freedom from the four- 
fartor mteraclions A.B.C.D. If the j sets are used, then another pair of degrees 
of freedom ^ be obtained. Thus ^ the 16 degrees of freedom will be obtained 
m pans. We are consequently provided with a set of designs for confounding 
a 3* design m blocks of 27 plots. ® 

my be continued indefinitely, and a similar process may be 
27 p^ rtc ^ ^ ^ ^ 


II. The subdivision of sets of degrees of freedom. 

iia. Subdivision of main effecU. 

^ ^ fertilizer is proportional to the amount of the fertilizer 

tveU ® line, and if the fertilizer is applied 

ee levels, equally spaced, the response per unit dressing wiU be estimted 


I 


60 


from the difference of the two extreme values. Moreover in such a case the 
vield of the central dressing will be equal to the mean of the yields of the two 
extreme dressings, except for experimental error, and consequently the observed 
difference of these two quantities may therefore be taken as a measure of the 

curvature of the response curve. . , , ^ , ... 

We may thus divide the two degrees of freedom for a fertilizer, n say, at 
three levels into two single degrees of freedom, one representing the average 
or linear component of the response and the other the curvature. These 
quantities may be denoted by N' and N , defined as 

N‘= nj 


N"= n., - 2 «, + «o 

N' is therefore the response to the double dressing, and N" the difference between 
the responses to the second and to the first dressing.* 

The sums of squares corresponding to N' and N are given by 

± [NT and [.V"]^ 

respectively (6= i" + 2 ^ + i*). where_^is the number of plots contributing to 
\n 1 etc. The standard errors are Vz/n and times the standard error of 
a single plot. The two degrees of freedom are orthogonal, and consequently 
the MO Lms of squares total to the sum of squares for the two degrees of 

freedomj^ response is substantially linear over the range 
of squares for N' will be much greater than that for N' and it may well be tha 
N' Attains significance although the sum of squares for the two 
freedom fails to do so, owing to the diluting effect of N . The test of /V a 
Is always legitimate, and should be made when ‘he two deg^s ^gethet fad to 

attain significance and inspection of the results f JX? Vi^ 

The experimenter who confines his attention to ‘>'0 '™o “leg 8 * ^ 

Thus^^n theTxample just given the mean square for sowm^ 4e 

iS™;. » ■ssxiss.sisss; 

and nitrogen in a similar manner. ^ ^ Xhis illustrates the high 

.. *. - « 

all accurately^ 



61 


Similar divisions can be made when other types of treatment are involved. 
Thus in the experiment given in Table 41 the two degrees of freedom for 1933 
and 1935 treatments might be divided into two single degrees of freedom, one 
representing the response to fertilizers, i.e. the mean of artificials and compost, 
and the other the (ufference between artificials and compost. Note, however, 
that if the single degrees of freedom were chosen to represent the response to 
artificials and the response to compost, these would not be orthogonal, and 
consequently the corresponding sums of squares, although each would in itself 
give nse to a z test of significance identical with the t test, would not add up 
to the total sum of squares for this set of treatments. There is no reason why 
the separate comparisons considered should always correspond to orthogonm 
degrees of freedom, but this will most frequently be the case in well designed 
experiments. 

Sets of three or more degrees of freedom can be divided in a similar manner. 
There are many possible alternatives, which we have not the space to discuss 
here. The point to remember about all such subdivisions is that to be useful 
they must correspond to some reasonable simplification of the treatment effects, 
e.g. that forms of nitrogen are equivalent, that the response curve to a fertilizer 
can be reasonably represented by a straight line, or a second degree curve, etc. 
Whether such simplifications are in fact contradicted by the data can then be 
rigorously tested. 


iib. Subdivision of interactions. 

Corresponding to any given subdivision of the degrees of freedom for the 
main effects of a factor, there exists a corresponding subdivision of the associated 
interaction degrees of freedom. Thus in the previous example the four degrees 
® , ‘^*®r3ctions between sowing dates and spacings may be 

subdivided into the interaction of the linear responses D'.S\ the interactions 
ot each linear response with the other curvature. D\S' and jy.S\ and the inter- 

T P' S\ for example, indicates the linear 

toTar^ varies^^ response to rf, or alternatively 

Equally ^ " ^0) 

— i.“is t 


62 


Table 52. Expressions for interactions of a 3 x 3 table. 



A'.B’ 

bo bi 


b» 


+i -2 +i 
-2 +4-2 

+1 - 2 +I 


Divisor 


36b 


n being the number of plots included in each total of the 3 x 3 tzhk. As usual 
the divisors required to give the interactions m units of a single plot yield are 
one- W the above divisors, and the multipliers of the error mean squwe required 
to give the error variances of the totals are equal to the above divisors. 

Applying the above multipliers to the and r tabl** of the previous example, 
we obtain the results of Table 53. 


Table 53. Numerical values op interactions. 


Interaction 

Total 

cwt. per acre 

D'.S' 

+25.5 

±17-9 

+2.12 

±i. 4 » 

D'.S' 

+ 57-9 

±31.1 

+1.61 

io.86 

D'.S' 

- 44-9 

±31-* 

-1.25 

io.oo 

D'.S' 

-90.! 

± 53-8 

- 0.83 

±0.50 


Sum of squares 
27.09 

46.56 

28.00 

37-58 


J39-23 


A systematic method of arriving at the above totals, »lso *0 “f Is for 

the corresponding main effects, is shown in Table 54. In the first thr 

Table 54. Computation op main effects and interactions of a 3 x 3 table. 

■’ * A** 


(0 


738-8 738.7 

-38-6 -3-4 

-55.0 +19.0 


(2) 


Key 


Total 5 ' S' 

D' D'.S' D'.S’ 

D' D'.S D'.S' 


684.8 2162,3 -54-0 -53-8 

-13.1 -5S-> +*5-5 -44-9 

- 55.0 T.y.v + 2.9 -33-» +57-9 -9o-> 

columns (i) the first line represents the totals of the three "ll 

of the d and s table (Table 46), the second line the differences dj - ^ 
cois and tte third line the quantity d, - ad. + d for each Each 

number 'need only be written on the machme once, the sequence being . 

+217.8 

-256.4 

- 18.6 


+ 256.4 X 2 
- 264.6 X 2 


'55-0 


+264.6 X 3 


738.8 



£3 


The computer must learn to read negative numbers directly from the machine. 

A second application of the same process to the rows of (i) gives the required 
quantities (2) in the order shown. 

iid. General remarks. 

The method used in the above example is perfectly general, and can be 
used to subdivide the interactions in a^ manner corresponding to that adopted 
for the main effects. If the main effects are orthogonally divided then the 
interactions will also be orthogonally divided. Moreover there is no need to 
subdivide into single degrees of freedom. Thus if the factor a represents three 
varieties and h three levels of a fertilizer, we may subdivide into two pairs of 
degrees of freedom A.B' and A.B " : the former will be given by the differences 
between the linear responses of the three varieties, the latter by the differences 
between the curvatures. 

Subdivision of interaction degrees of freedom is useful, in the same manner 
as was the subdivision of main effects, for throwing into prominence effects 
which might otherwise escape notice. In the interaction of two fertilizers, for 
example, we should expect the component A .B' to be large compared with 
the remaining components, but if the four degrees of freedom are jomtly tested 
its significance might be obscured. Subdivision is also useful when estimating 
the error from interactions, since we may reasonably expect interactions involving 
a component of curvature to be small even in cases where the component A.k" 
cannot legitimately be included in error. An example of this is provided by a 
single replication of a 3 x 3 x 3 design. 


12. The 3x3x3 DESIGN : single replication. 

.,T^ particular design is of considerable practical importance in agricultural 
fertilizer trials, for it enables the optimal levels of all three standard fertilizers 
to be simultaneously investigated, and is not too large to be undertaken on 
ordma^ commercial farms. We will therefore analyse the first replication of the 
sugar beet experiment already given, treating it as if it were the whole experiment. 


I2a. Systenaiic method of analysis. 

Since experiments of this type are usually undertaken simultaneously at 
a number of centres, it is advisable to adhere to some systematic method of 
an^ysis and presentation of the results. In practice it has been found best in 

® response to the double dressing of each factor 

r response), Je difference of the additional response to the second and 

otinSon component 

obuLg''t“oSfa.Te 

at Z SSt&rtfsS'iSi ‘aTfflS 


64 


Table 55. Calculation of main effects and interactions. 



So 

St 

Si 


W 

X 

y 

Z 

do 

144.0 

145-2 

142.8 

432.0 

402.6 

403-5 

439-9 

420-3 

di 

H 3-3 

» 39 -i 

129.1 

4 n -5 

396 5 

415-3 

425-8 

397-4 

d 2 

128.1 

> 37-5 

115.6 

381.2 

425.6 

405.9 

359-0 

407.0 


3. Calculate the total sum of squares for each of the first three two-way 
tables, obtaining one set of marginal totals for each of these tables in the process, 
and also the sums of squares for [Y] (blocks) and for [W], [X] and [Z]. These 
sums of squares are shown in Table 56 (blocks in Table 58). 


Table 56. Auxiliary table op sums of squares. 

D, S and D.S 262.05 ^ 250.82 

D, N and D.N 333.49 W, X 2nd Z 90.36 

4. Calculate the totals for the linear responses and curvatures from the 
main-effect totals, and at the same time check the total of each set of main-effect 
totals. The method of Section i ic may be used. Thus 381.2 - 432.0 = - 50.8. 
381.2+ 432.0-2 X 411.5* -9.8, and the total (which need not be written 
down)* 1224.7. the values obtained in Table 57. Then take the sum 

of squares of the linear response totals, dividing by 18, and the sum of squares 
of the curvature totals, dividing by 54, and enter in Table 58. 


Table 57. Main effects and interactions. 


Factor 

Totals 

Curvature 

response 

cwt. per acre 

Curvature 

response 

Factors 

Interactions 

Total cwt. per acre 

D 

S 

N 

-50.8 -9.8 

-27.9 -40.7 

+45-4 ~ 32-0 

-5.6 -i.i 

-3-* - 4-5 

+ 5-0 - 3-6 

D'.S' 

D'.N' 

S’.N' 

-11.3 - 1-9 

-19.4 -3-2 

+13.8 + 2-3 

St. error 
Divisor 

±14-4 ±25.0 

18 54 

±1.6 ±2.8 

St. error 
Divisor 

±n .8 ±i-o 

12 


Table 58. 


Blocks 

Linear responses . . 

Curvatures 

Linear interactions 
Other interactions (error) 


Analysis of variance. 


D.F. 

2 

3 
3 
3 
J5 


Sum of squares 
415.03 
301.12 
51.42 
57 87 
173-79 


Mean square 
207.52 


11.59 


999-23 


1 Ut4l •• •• •• - 

If the sum of squares for blocks and those of thf total of 

squares for the linear responses and oumtures, add “P ^3 checked, 

squares, the whole computation up to this pomt may be r ga 



5S 


5. Calculate the totals for the linear components of the interactions from 
the cross differences of the comer values of the two-way tables, entering these 
in Table 57. Thus 144.0+ 115.6 - 128.1 - 142.8= - 11.3. Divide the sum 
of squares by 12 and enter in Table 58. This calculation must be carefully 
checked. 

6.. Calculate the error sum of squares by subtraction, and complete Ta ble 58. 
Enter the standard errors of the totals in Table 57, e.g. 11.59= ^ 4 * 4 ' 

Then convert the values of Table 57 to the proper units. Here the conversion 
factor for the linear responses and the curvatures is and for the interactions 
is since the yields of the single plots are already in cwt. per acre. 

This completes the analysis. Tests of significance can be made in the 
ordinary manner by the t test. The linear responses to change of sowing date and 
nitrogen are significant but that to spacing is barely so. The error mean square 
11.59 agrees well with that already found from the analysis of the whole 
experiment. 

12b. AUemative method. 

An alternative method of analysis is to obtain all the main effects and two- 
factor interactions as single degrees of freedom by the procedure illustrated in 
Table 54. It will be noticed that each component of the main effects appears 
in ^0 tables. The computation can therefore be slightly abbreviated by the 
omission of one set of mam effects from each table. The total of each 1 x ■» 
table should, however, be checked. ^ ^ 

If this procedure is adopted there is no need to compute the sums of squares 
tor the 3 X 3 tables sho^ in Table e6. The final analysis of variance will 
appear in the form shown in Table 59, the whole computation being self-checking. 


Table 59. Analysis op variance, alternative method 


Blocks 

Linear responses . . 

Curvatures 

Interactions : Linear x linear 
Linear x curv. 
Curv. X curv. 
W, X, and Z 

Total 


D.F. 


26 


Sum of squares Mean square 

41503 *07-52 

301.12 

51-4* 

57-87 

68.19'! 

*5 m}-i73-77 »i-59 
90.36J 

999-*3 


I2C. The linear component of the three-factor interaction. 

factors giveTby L LHf X 

(Section 2c) * ^ ^ components of the main eflPects 




66 


At first sight its estimation in confounded experiments appears complicated. 
There is, however, no great difficulty, for we have the identity : 

[A'.B'.C']= h { 

as is easily verified from Table 43, or numerically from Tables 45 and 46b, 
ignoring the confounding. 

In partially confounded experiments it is only necessary to substitute the 
corresponding totals, freed from confounding, which are denoted by dashes in 
Table 46 multiplying these by the necessary fraction to make them the equivalent 
of totals over the whole experiment. Thus in the exarnple already given 7 and 
Z are partially confounded and the totals [ 7 '] and [Z'] include only half the 
plots and must therefore be multiplied by 2. (If there were four repliwtions, 
confounding all three-factor interactions, the multiplier would be 5.) Thus m 

our example we have : 

[D'.S'.N'] = ^694-6- 715-6 + 721 - 7 - 721 - 2 +2(317-8 -316.7) 

+ 2(407.0 - 397.4)} = + 0-3 

and the error variance is 


■\{i% + 18 + 18 + 18 + 2^(9 + 9 + 9 + 9 )}‘^“ = 24<r^ 

Consequently, in units of a single plot yield, here cwt. per acre, 

iy . S '. N ' =■ H+ 0 - 3 )=+ 0-04 ± 2.24 

since there are two replications, so that -^^^ould be the diff 
sums of 8 plots each if there were no confoundmg The same 
reached (more laboriously) by usmg the table of adjusted yieltk 

If there were no confounding the error y^ance of 
a components equally confounded i of the relative information would be 

retamedO components of the three-factor interaction is eompletely 

confounded, as must be the case in a single Remaining 

the linear component is possible unless it is ^ differences 

components are negligible. component. Thus 

[W 1 - ri 7 1 etc provides a separate estimate of the Imear c p , 

S a Single repii^tion only, Ln K is confounded, as m the example just 

considered, we have . 

- [W,] + [X,] - [X,] + [Zs] [ 22 ]}. 

the additional factor | being introduced to compensate for Aymissm 
of the four estimates, together with a further factor { to give i 

of a single plot yield. Hence 

D' .S' .N' = - ^02.6 + 405.9-403-5+ 407-0 397 - 4 / 

= U+ 5 - 9 )=+ 0-66 

The error variance of A'.B'.C is now given by 

b’t (6 ^ 9) = no 

so that the standard error of the estimate is here ± 2.7 • 



57 


confounding the error variance would be tV (8) Consequently f of the 

relative information is retained, but, of course, only on the assumption that the 
oAer components are negligible. 

The sum of squares attributable to the corresponding single degree of 
freedom is given by 

■HJ(+5-9)]“ = 5V(+5-9)“=o.64 

This can be deducted from the sum of squares for error in Table 58, leaving 
14 degrees of freedom for error. Clearly in a series of experiments this deduction 
should be either made or not made consistently : it is not permissible to perform 
the deduction only when the error is reduced thereby. 

The following alternative series of expressions (for a single replication, 
Y confounded) may be noted. If 

or 

^Q-=3[A'.B'.C]^[Y,]-[Y,] 

then 

A'M'.C- hQ-hm 

The error variance of this estimate is 


2.2 


and the sum of squares is 




The above expressions are worth careful study. The total [A'.B'.C'], which 
would form the basis of the estimate in an unconfounded experiment, is corrected 
by the requisite fractions of the block totals [ 7 ,] and [1%] to eliminate block 
effects, giving Q. The fractional multipliers can then all be written down, if 
the relative information, here }, is known, by multiplying the fractions that 
would be used m an unconfounded experiment by the reciprocal of this relative 
information. Thus x x ^ and | x Note how is used in 
place of Q m the actual computation. 

This method of adjustment by means of block totals forms the basis of the 
analyti^l methods applicable to confounded designs involving factors at both 
two and three levels, which are described in the following sections. 


13- Confounding with some factors at two and some at three levels. 

«iT« j containing factors at both two and three levels cannot be so 

simply confounded as those containing factors at two or at three levels onlv 

theVatment combinationlt^trwlS 
wS the interactions. The best designs are those which 

Thele confounding zs much as possible to the highest order mteractions 
Tte designs nec^riTy involve the partial confounding of tS mor?SZ; 

fraction of -f trwtment degrees of freedom. The 

however, qi^ ™portent interaction is, 


68 


Designs of this type are not quite so simple to analyse as designs of the 
2" or 3^ types. The designs must be balanced, and therefore the number of 
replications used must be some multiple of the number reaped for a balanced 
arrangement. The computation is similar for all the different patterns. An 
example is given for the 3 x 2 x 2 design, which will illustrate the use of the 
formulas. 

13a. Statistical analysis 0/ 3 x 2 x 2 design. 

Denote the three factors by ^(0, i, 2), 5 (o, 1), C(o, i). Since 4 is not 
a factor of 6 it is clear that the interaction B.C cannot be completely uncon- 
founded if the experiment is arranged in blocks of 6 plots. The design of 
Table 60 Confounds B.C as little as possible. 


Table 60. 3x2x2 design in blocks of 6 plots. 


a 

la 

b 

c 

a 

Ib 

b 

c 

a 

Ih 

b 

C 

a 

Ilb 

b 

c 

a 

Ilia 

b 

c 

a 

Illb 

b 

c 

0 

0 

1 

0 

0 

0 

0 

0 

0 

0 

0 

1 

0 

0 

0 

0 

0 

I 

0 

I 

0 

0 

I 

] 

0 

I 

X 

0 

I 

0 

0 

1 

I 

0 

1 

0 

I 

0 

0 

I 

0 

1 

I 

0 

1 

1 

0 

0 

I 

0 

0 

I 

0 

1 

I 

1 

I 

1 

1 

0 

1 

1 

0 

I 

X 

I 

I 

1 

I 

1 

I 

0 

2 

0 

0 

2 

0 

I 

2 

0 

0 

2 

0 

I 

2 

0 

1 

2 

0 

0 

2 

I 

1 

2 

1 

0 

2 

2 

1 1 

2 

1 

0 

2 

1 

0 

2 

z 

1 


The interactions B.C and A.B.C are partially confounded with block 
differences in each replication, since the actual degree of freedom confounded 
lacks orthogonality with both these sets. In each replication the confounding 
is different, the three replications giving a balanced design which enables the 
treatment degrees of freedom B.C and A.B.C to be estimated without difficulty. 


and if we ca 


13b. Statistical analysis ©73x2x2 design. 

Since the interaction B.C is partially confounded it is necessary to correct 
the ordinary interaction total [B.C] by means of the block totals [la], [Ib], etc. 
If 

m - [M - - [Ila] - g,, [Illb] - [Ilia] - g„ 

cu ate 

30 = 3[B-C] + g^ + gi + g3 

it can easily be verified that Q is unaffected by block differences or treatment 

effects other than B.C. . . 

The estimate of B.C in units of the yield of a single plot is given by 

B.C= i^ 0 =fs( 30 ) ^ . 

when there are 36 plots. The error variance of B.C is 1 rS^ 

unconfounded experiment the estimate and error variance would be ^5 \P- -I 

and The corresponding sum of squares is 

sV0" 2 S 5 izQV . j 

as compared with ^ [B.C] ' in an unconfounded experiment. The re a 
information is given by the ratio 

1 /I - S 



69 


Thus k of the information is lost by the confounding when there is no reduction 
in the error variance per plot. 

The estimate of A.B.C is obtained in a similar manner. Calculate the 


-gi + gi + gs 
+ gi-g2 + gs 
+ ^i + ^2 -ga 


three quantities 

2R0 = 3 [B.C.a^] 

3 i?i = 3 

3i?2 = 3 [B.C.a., 

with the check that + 3/^2 = 36 - 

The interaction A.B.C, in units of a single plot yield, is given by 
A.B.C^ dev dev {iRa, 3-^2) 

as compared with | [ 5 .C.Uo], etc., in an unconfounded experiment. The error 
variance applicable to each of these quantities is |<r“, as compared with ^ 

The sum of squares is given by 

/b dev“ R^) = g'jj dev* (3/?o, 3^?,, 2R2) 

The relative information is given by the ratio 




s 

7 


and the relative loss of information on each of the two degrees of freedom is 

therefore Note that , 

I X 2 X I 

w^esponding to the single degree of freedom confounded in each replication. 
This is a property of balanced arrangements. 

The reader will find it instructive to construct the above formulae by means 

of the rule given at the end of the last section, using only the fractions representing 
the relative information. 


13c. Example. 

The plan and yields of the experiment on potatoes already referred to in 
bection qd (1/65 acre plots) are given in Table 61. 


Tablb 61. Plan and yields (lb.) of 3 x 2 x 2 experiment. 


I* Ib lit 


n% 

172 

nop 

161 

1 

flip 

231 

llo 

166 

nomp 

208 

nmp 

*44 

nm 

192 

ffi 

HS 

nop 

204 

7 Mnp 

253 

nm 

190 

no 

104 

IlMp 

227 j 

nimp 

232 

ntm 
^ 3 ^ 1 

RVR 

214 

no 

**3 

mp 

131 

lix 

276 

nam 

186 

ntm 

238 

1 

198 

ns 

258 

mmp 

X71 

132 

nvf^ 

242 

nop 

no 

*75 

nm 

171 

• 

tup 

*35 

nmp 

296 

▼T 

nop 

^78 

V 

230 

nip 

226 

nm 

* 4 ^ 

nop 

103 


JIhi Illb Ilb 


80 


Using the results already obtained in Table 38 we have 

3Q = 3(-6i )+ 170-6+ 127= +108 
3^0= 3(+63 )- 170-6+ 127 = +140 
3^1= 3 (-ii 3 )+ 170+ 6+ 127-= -36 
3i?a* 3(- ii)+ 170-6- 127= +4 
P.M= +2.25= +0.07 tons per acre. 

P.M.iV= +10.4, - 7.2, - 3.2= +0.30, - 0.20, - o.io tons per acre. 

The sums of squares are : 

D.F. Sum of squares Mean square 
P.M I 40.5 40.5 

P.M.N 2 283.7 141.9 

Replacing the values already given in Table 39 by these, we can complete 
the analysis as shown in Table 62. 


Table 62. Analysis of variance of 3 x 2 x 2 experiment. 



D.F. 

Sum of squares 

Mean square 

Blocks 

5 

24938.9 

2017.0 

Treatments 

11 

26302.1 

2391. I 

Error 

19 

6363.8 

334-9 

Total 

•• 35 

57604,8 



Two two-way tables will be required to show the interactions between p and n 
and m and n. Since these interactions are not affected by the confounding the 
tables can be obtained directly from Table 38 in the manner already described. 
If a two-way table for p and m is also required it can best be built up by the 
method of Section ^ using the value of P.M calculated above. These three 
tables are given in Table 63. 


Table 63. Two-pactor tables, tons per acre. 



no 

m 

na 


(I) 

L 

(0 

4.14 

5. II 

4.68 

4.64 

4 - 4 * 


m 

5.89 

6.31 

5-55 

5.92 

5.61 

6.22 


5-01 

5-71 

S-I2 

' 00 

5.01 

5 SS 

(I) 

4.70 

5.50 

4.82 

5-01 



p 

5 - 3 * 

5 - 9 * 

5 - 4 * 

5 -SS 




Since no one of the interactions between two factors is sigmntaiu u 
scarcely be necessary to give a three-way table to exhibit the 
rthrle factors, b^t /one is required the calculation n«v be »“t m 

two stages as in the 3 x 3 x 3 example (Section loc). Thus, neglect g 

interaction iV.M.P, . .,0 a ^-1 

To incffie^e^e; A^frt t 

^ron mi^t be added to the lines ^(t) and mp and subtracted from the 

hues ? and m, thus g .,5. 6.38 

The full sets 1 of values are shown in Table 64. 



61 


Table 64. Three-factor table, tons per acre. 
(a) Xcglecting N.M.P (b) Including N.M.P 




nx 

na 


no 

Ttl 

na 

(>) 

3-87 

4,94 

4,42 

(0 

4.02 

4.84 

4-37 

p 

4+2 

5.28 

4-94 

p 

4.27 

5.38 

4-99 

m 

554 

6.06 

S-2I 

m 

5-39 

6.16 

5.26 

mp 

6.23 

6.54 

5.87 

mp 1 

6.38 

6.44 

5.82 


13d. 3 X 2 X 2 X 2 design in blocks of 6 plots. 

With three factors at two levels (but not with more) there is an arrangement 
in blocks of 6 plots similar to that with two factors at two levels, only i- of the 
relative information on the interactions between pairs of factors at two levels 
being sacrificed. 72 plots are required to provide a balanced design. The 
12 blocks of this design are given in Table 65. 


Table 65. 3 x 2 x 2 x 2 design. 


Level of a 

la 

Ib 

Ic 

Id 

1 Ila 

Ilb 

lie 

Ild 

Ilia 

Illb 

IIIc 

Illd 

Oe 

b 

(0 

i 

c 

c 

d 

(0 

b 

d 

c 

b 

(1) 

at 

c 

i 

(0 

b 

d 

( 

b 

(0 

b 

(0 

d 

c 

at 

i 

c 

b 

(0 

b 

(0 

d 

c 

e 

d 

(0 

b 


in uiis wDic omy one or rne pair ot combinations ot b, c and d for each level 
of 0 18 shown. When (1) occurs bed must occur also; similarly cd must 
occur with b, bd with c and be with d. Thus the block Ib conuins the treat- 
ments Qq, a^bed, a^d, a, be, a.j,c, a.,bd. 

The required formula are simple extensions of those applicable to the 
J X 2 X 2 design. Denote the differences between the block totals in replication I 
, andg/, where 


gl = 

[Id 

+ 

Ib] 


[Ic 


[Id] 

gl- 

[Id 

— 

Ib 

+ 

Ic] 



gl = 

la] 

- 

Ib 


Ic 

• 

+ 

Id] 

k J 


— lor replications 11 and 111. 

repUad hZn by' Q be 

rp. , . 

ai ZwCr 

Extension to 2^2" in blocks of 2 x 2 "-' and 2 x z’*'^ 

desi^i? ^ i^" ^ methods and equations set out for the 3 x 2 x 2 

esign m blocks of 6 plots are immediately appUcable. Take X,to ?epr«ent 


62 


the treatment combinations which are taken as positive in the interaction between 
the n factors at two levels, and Xq the combinations which are taken as negative, 
so that with three factors A, c, and d at two levels, Xy represents the four 
combinations bed, b, c, d, and X^ the four combinations be, hd, ed, (i). As 
before three complete replications are necessary, the six blocks being those 
shown in Table 66. 


Table 66. 3x2" design in blocks of 3 x 2" ' plots. 


la 

Ib 

Ila 

Ilb 

Ilia : 

Illb 


QoXt 

doXt 

CqXo 

doXi 

OoXo 

aiXi 

dtXo 

atXo 

QtXt 

aiXt 

dtX^ 

atXi 

QlXo 

(i%Xi 

CiXo 

QzX^ 

QiXi 


The interaction between all the factors at two levels and the interaction 
between these and the factor at three levels, will be partially confounded. 
The only modification required in the formuls already given is a proportionate 
increase in the numerical divisors to allow for the increased number of plots. 

The extension of the 3 x 2 x 2 x 2 design follows exactly the same lines 
as the extension of the 3 x 2 x 2 design, giving blocks of 3 x 2’'*^' If, for 
example, a fourth factor e at two levels is introduced the interactions B.C.E, 
B.D.E and C.D, and their interactions with A, might be chosen for partial 
confounding. The design is given by writing h and e for 6, and (i) and be for 
no 6, in the 3 X 2 X 2 X 2 design. Thus block la will contain the plots 

aj), age, a^cd, aj}cde, a^c, a^bce, a^bd, a^de, a^d, a2bde, a^bc, a^ce. 

It may be noted that there is no 3 x 2 x 2 design in a 6 x 6 quasi-Latin 
square which leaves the main effects completely unconfounded. A design 
exists which partially confounds the interaction between the two factors at 
two levels and the interactions between all three factors, and in addition slightly 
confounds the main effect of one of the factors at two levels. In view ot the 
additional complication in the computations we have omitted this design. 


3x3x2 design. 

Denote the three factors by ^4 (0, i, 2), 5 (0, i, 2), C (0, i). Since 0 w 
not a factor of 6 the interaction A.B cannot be completely unconfounded when 
the experiment is arranged in blocks of 6 plots. Using / and J to.M^a e 
different diagonal sets of the combinations of a and b, m indicated in Table 4 , 
we have theVlowing design of 36 plots (Table 67) which partially confounds 

A.B (/) and A.B.C (/). 


Table 67. 3x3x2 design in blocks of 6 plots. 



la 

Ib 

Ic 

Ila 

lib 

lie 

Co 

12 

I3 

h 

h \ 

h 

h 

Cl 

I) 

h 

U 

h \ 

h 

h 


63 


The first block, for example, will contain the treatments 

^ 0 ^ 2^01 

A similar design, which confounds A.B (J) and A.B.C (J), is obtained by writing 
J instead of 1. If 72 plots are available both designs should be used, so that 
all four degrees of freedom for A.B, and also all four for A.B.C, are equally 
confounded. 

The method of analysis is similar to that applicable to the 3 x 2 x 2 design. 
To estimate the 1 component of A.B when there are 36 plots and the I components 
are confounded the quantity 

20, ^ 2[/J - [lb] - [IC] - [lib] - [//C] 

and two similar quantities 20^ 2nd 203 may be calculated. The sum of these 


IS zero. 

The relative information is |, so that, since ^ x the estimate of 

the interactions is given by 

A.B (/)= 02, 03)= tV20., 20„ 203) 

the error variance of each of these quantities being ^ The sum of squares 
for the two degrees of freedom is 

_ ^5(0“)= 3V 5(20)= 

The estimates of the confounded components of A.B.C are obtained by 
calculating the three quantities 

, ^ f ["“’1 - ["^1 

etc.^ where {I\.C] denotes the sum of the /j components in the table of - Cq. 
The sum of the three quantities is 2[C]. The relative information is so that, 
since T ^ the estimate is given by 

A.B.C (I) = J dev i? = ^ dev 2 R 

Note the introduction of an extra 2, since one of the factors is at two levels only 
I he error variance of each of these quantities is ^ x f (r^ 

The sum of squares for the two degrees of freedom is 

} dev=/?= dev= 2 R 

. The formula for the design of 36 plots which confounds the 7 components 
interaction arc obtained from the above formula by writing 7 for I. 

then ^ and J Components are confounded, 

otal mTX the quantities 0 are calculated as above, but each 

Ihe relative information is now i, so that the divisor o in the above formula 

for A B C^(l)lr^Ihr ® O) are similarly obtained. Estimates 

infnmf;- ^ obtained by calculating quantities R as above, but the relative 
.nformat.o« ,s now 8 , so that all the divisors given above must be multipUed bTs 

^3g- 3 3 3 2 design in blocks of 6 plots. 

inter JcdonVo^aUMitTLr’’ i" which the 

same manner as i^ he ^ I'™'? Pn«ially confounded in the 

two degrees offridom^nf ‘he designs 

levels a?e conmleSlv r mtemction between all three factors at three 

freedom are give? ^ ^ab^bf confounded degrees of 


of 


f 


64 


Table 68. Confounded degrees of freedom IN3X3X3X2 designs. 

Design: “W" "X" “7” “Z” 

Partially 

confounded 

Completely 

confounded A.B.C W X Y Z 

From this table it will be seen that if all four designs are used (432 plots) 
the I and J components of all the partially confounded interactions are con- 
founded equally, so that, as in the 3 x 3 x 2 design in 72 plots, the relative 
information on A.B, A.C and A.D is J, and that on A.B.D, A.C.D and B.C.D 
is In addition all components of A.B.C are equally confounded, the relative 
information being 

If the 27 combinations of the three-level factors are divided into the 
following 9 sets of three: 


Ki 

K 7 


K, 

K, 


Kl 



000 

100 

200 

010 

110 

210 

020 

120 

220 

111 

211 

on 

I2t 

221 

021 

lOI 

201 

001 

222 

022 

122 

202 

002 

102 

212 

012 

X12 


then the first 9 blocks of the “ Z ” design are those given in Table 69, the 
other 9 blocks being obtained by interchanging d„ and di. 


I 

i 


(A.B and A.B.D 

A. C and A.C.D 

B. C and B.C.D 


j 

y 

I 


y 

I 

y 


I 

y 

y 


I 

I 

I 


Table 69. First replication of the 3x3x3x2“Z” design. 


Block 

la 

Ib 

Ic 

Id 

It 

If 

h 

Ih 

li 

(/o 

Kx 

Kf, 

Ki 

Ki 

Ka 

Ki 

K3 

K, 

Ki 

dl 

Kf, 

Ki 

Kx 

Ka 

Ki 

Ki 

K, 

Ki 

Ki 


The “ W," “ X ” and “ Y ” designs are obtained from the " Z ” design 
by interchanging anda.^, Aj and A^.and Ci and c.^ respectively in the expression 
for the K's. Thus for “ IF” we take to represent the combination 000, 
211 and 122, etc. 

The estimates of the partially confounded effects are obtained in exactly 
the same manner as in the 3 x 3 x 2 design. Thus to estimate A.B (/) in the 
” Z ” design the quantity 

[le] -[If] '[Ig] -[II'l 
He] - [Ilf] - [Ilg] - [Ilh] 

and two similar quantities are calculated. 


2 [/Jab- 

la] - 

[Ic] - 


Ila] - 

[lie] - 


ijh. Extension (o 3" x 2 designs in blocks of 3'”' x 2 and 3'"^ x 2 plots. 

The designs already given can be extended in the same manner as the 

3x3x3 and3x3x3x3 designs (Section lof). 

It may be noted here that there is no reasonably simple 3 x 3 ^ ^ f ^ design 
in blocb of 6 plots. A design in blocks of 1 2 plots (and rnore generally a design 
for 3 X 3 X 2" in blocks of 3 x 3 x 2’’‘^ plots) may be obtained by extending 
the 3x3x2 design in the same manner as the extension of the 3x2 x 

design to 3 x 2" in blocks of 3 x 2-“* This design confounds A.B and A.B.L.D 



65 


only, but there are other designs which sacrifice less information on A.B, at 
the expense of confounding A.B.C and A.B.D, and generally increasing the 
complications of the computations. We shall not consider them here. 

jji, 3x3x2 design fn a 6 X 6 quasi-Latin square. 

It is possible to form a square of which the rows arc the blocks of Table 67, 
and thus confound the I components of the interactions between the two factors 
at three levels, and the columns are the similar blocks which confound the 
J components of these interactions. Only one such square exists (except for 
permutations of rows and columns). This square is shown in Table 70, where 
the first figure of each number indicates the combination of the two three-level 
factors, and the second figure the level of the factor at two levels. 


Table 70. 3x3x2 design in a 6 x 6 quasi-Latin square. 


20 

70 

60 

41 

3 > 

81 

40 

30 

80 


5 * 

n 

90 

50 

10 

21 

71 

61 

7 * 

61 

Zl 

30 

80 

40 

3 » 

81 

4 * 

SO 

10 

90 

SJ 

II 

9 * 

70 

60 

20 


The estimates of the confounded interactions are computed in exactly the 
same manner as in the 3 x 3 x 2 design in blocks of 6 plots, using row and 
column totals instead of block totals. The relative information on the inter- 
actions between the two three-level factors is 4, and that on the interactions 
of ail three factors is |. 

In this design there are only 8 degrees of freedom for error, but in view 
of the small amount of information available on the three-factor interactions 
these may justifiably be included in the estimate of error, giving 12 degrees 
of freedom in all, except in cases in which these interactions are likely to be 
large. This saves an appreciable amount of computation. 


If Confounding with one or more factors at four levels or eight levels. 
14a. General method. 

one “f confounding when 

°evels can be f™--, »■; “ght levels and the remainder are at two 

factor a at four levels there are associated three deerees of freedom 
which may be partitioned into single degrees of freedom as flows f ’ 

A'^^ = <*3 + - fll - <2© 

= ®3 -<*2 “ fli + a © 

KafnrtheTnlSl ” “ ^l^sMy di Jrent“sense from those in Section it. 

linear compm^t cegr^ion 2^' + A"' represents the 

F . na 2/t A represents the cubic component. If A'" is 


confounded and the cubic component is assumed to be negligible then {A' 
gives an estimate of the linear component of the regression. 

Using this partition, A' and A" may be taken as representing the main 
effects of two different two-level factors, in which case A'" will be their inter- 
action. A single factor at four levels may thus be formally replaced by two 
factors at two levels. In a similar manner, a factor at eight levels may be 
replaced by three factors at two levels. 

14b. Example .'4x4 designs. 

As an example we may consider the design of a 4 x 4 experiment (factors 
a and b) in bloc^ of 8 and in blocks of 4 plots. 

A 4 X 4 design is the equivalent of a 2* design. With blocks of 8 plots 
any single degree of freedom for interactions between the four two-level factors 
may be confounded. We might, for instance, confound A' .A" .B' .B" , which is 
equivalent to A"'.B'". This would be the best single degree of freedom to 
choose if we wished to keep the linear and quadratic components of interaction 
as free as possible, without resorting to partial confounding. The partition 
of the treatment combinations into the two types of sub-block would then be 
given by the + and - signs in the product : 

- {a, -a,+ a,- Oo) {b, -b,+ b,~ bo). ^ 

A better course, however, would be to confound different interactions in 
different blocks. If four replications were available, for example, we might 
confound A\B\ A^B'", A'^B” and A"'.B'" once each. 

With blocks of four plots three degrees of freedom will be confounded in 
each replication. With three replications the nine degrees of freedom repre- 
senting interactions between A and B may be confounded in three sets. One 
such group of sets is : 

A' .B' A' .B" A' .B"' 

A".B" A".B"' A".B' 

A"'.B'" A"'.B' A"'.B" 

The partition of the treatment combinations corresponding to the first set, for 
instance, is given by the four combinations of + and - signs, + +, + - 1 ■*+» » 

in the two products A'.B' and A''.B''. The three sets correspond to an 
orthogonal set of 4 x 4 Latin squares, with the rows and columns representing 
the four levels of the factors a and b respectively. 

A balanced arrangement of this type is particularly useful when 
factors represents four different varieties, or other treatments for whi^ 
possible comparisons are of equal interest, for in such a case the mteractions 
of A', A" and A" with B are all of equal importance. 

j^c. Comlnned varietal and manuring trials in Latin squares. 

There is not space here to give a complete enumeration of designs J 

all the various combinations of factors at 2, 4 and 8 levels, but ? . 

example in mind the reader should have no difficulty m constructmg me ^ 
he requires from the designs for factors at two levels given in Sections 5 an 



67 


In particular he should notice the possibilities of arranging combined varietal 
and manuring (or cultivation) trials in 8 x 8 Latin squares. 

Thus, for example, if four varieties and three fertilizers n, p, k are to be 
tested, the design of Table 32 may be used, identifying the combinations of 
b and c with the varieties, and a, d and e with «, p and k respectively. If V\ 
etc. are defined as A' etc. above, and V' is identified with B, and V with C, 
so that the combinations (1), b, c, he of b and c are replaced by Vi, v^, Vq, 
respectively, the following degrees of freedom will be confounded : 

Rows : V'.N.K, V.P.K, V"\N.P 

Columns : V'.P.K, V\N.P, V'".N.K 


With 8 varieties and the three standard fertilizers, variants of the design 
of Table 33 may be used. If the combinations of a, b and c are identified with 
the varieties, and </, e and / with «, p and k respectively, we shaU then have the 

following degrees of freedom confounded : 

Rows: VKN.K, V\N.P, V\P.K, V*.N.P.K, V\P, V\N. 

Columns: VKN.P, y\N.P.K, V\K, V\P.K, V^N.K, V\P. 

V^, r*, . . . K’ being a set of 7 orthogonal varietal degrees of freedom of the 
form T/. 

= Hi - 02 + 1)3 - _ Vf , 

etc., such that V^.V^= etc. A second square can be formed by making 
the cyclical change of varieties : 

8 being left unchanged. The square so formed will confound an entirely 

different set of interactions. A further application of the same cyclical change 

will comound a further set different from the first two, but complete balance 

^1 only be obtained by using all the seven squares given by repetition of 

the above cyclical change, when each interaction degree of freedom between 

manure and varieties will be confounded twice, 4 of the relative information 
being thus retained. 


The reader who is interested in the structure of these designs will do well 
to deterrnine their connection with an orthogonal set of seven 8x8 squares 

'P of Experiments (2nd edition), or in Statistical 

Medical and Agricultural Research. He may note further 
HeHvJw' f proposed m Section 8c for the design in 128 plots is 

as to ^ set. and should satisfy fcmself 

in Q ^oTQua?e.r‘ for 9 varieties and 9 treatment combinations 

of Table 51, the first number 

(1) 2 and 3, 4 and 7, 5 and 9, 6 and 8, 

(2) 2»^6t^3i^8^«2, 4*^5*^7^«9*-'4, 

(3) 2^*8»«3»«6»*2, 4»^^7^.5»44, 

KeenTem'^1 "iW f" ‘•'o fo“r squares will 

menu Ld vTriS I I 1 “" 3 ?°"'"'* “f interaction betoeen treat- 

“ vaneties, i of the relative information being retained. 


68 


15. Dummy treatments. 

It frequently happens in factorial experiments that one or more of the 
factors is of such a nature that certain treatment combinations are identical. 
Thus if one of the factors consists of three different qualities of a fertilizer and 
another consists of three different amounts of the same fertilizer (including no 
fertilizer), there will in fact be no difference between the different qualities at 
zero level of the fertilizer. If the formal factorial design is followed, three 
identical plots having no fertilizer will be included in each replication. There 
will consequently be additional degrees of freedom for error arising from 
comparisons between identical combinations, and correspondingly fewer treat- 
ment degrees of freedom. The partition of the treatment degrees of freedom 
into their separate components will also be different. Confounding, moreover, 
introduces further complication. 

There is not space here to discuss all the modifications that are required 
in the analysis of variance, if this analysis be conducted on strictly rigorous lines, 
but we will give certain arrangements of this type which will illustrate the main 
points. 

Possible types of confounding are derivable from the ordinary factorial 
designs already given, by using dummy treatments where necessary. Other 
types not so derivable may also occasionally be of interest. For an example 
of these latter see (8). 


15a. Application of fertilizer at two different times. 

As a first example let us consider the design of an experiment to determine 
the response of sugar-beet to nitrogen applied at two different times, in con- 
junction with early and late lifting of the crop. 

A 2 X 2 X 2 design, with factors n, time of application, and time of lifting, 
might be adopted. This would give the treatment combinations 

e, e', /, en, en', In, In' 

where the dash indicates the later application of n, and e and / indiwte earlv 
and late lifting. The combinations e and e', and I and I', are in reality identical. 

It is not difficult to see that the appropriate partition of the de^ees ol 
freedom, and the estimates of the corresponding effects, are those given in 
Table 71. 


Table 71. Partition op degrees of freedom. 

Effect Estimate 

Nitrogen (N) i(^+ tn' -t-e - / - / ) 


Time of application {A) 
Time of lifting (L) 

A*Ij 


\len-en'+ln~ln) 
i(en+ en - In- ln'+ e+e - l-l) 
J{eB+ en' -In- In' -e- e'+ /+ I ) 
\{en -en' - In + In') 


These degrees of freedom are all orthogonal, and the sums of squares, 

the sums of square from c - e' and / - 1 ', which are 

total to the sum of squares for the seven degrees of freedom obtained irom 

treatment totals by keeping e and e' and / and I' separate. 



w 


If the experiment is arranged in blocks of 4 plots the confounding of the 
formal three-factor interaction will give the two block types 

e e' 

V I 

en' en 

In In' 

The expression for A.L above is now not orthogonal with blocks. It may 
be replaced by the formal expression for the A.L interaction (with the numerical 
factor changed), namely 

l{en -en' -ln+ In' + e-e' -1+ I') 

which is orthogonal with blocks. The function of the plots without n is to 
act as compensators for any inequalities between blocks. It is clear that with 
the same error variance per plot the variance of the estimate of this interaction 
will be doubled by the confounding. 

There is now one error degree of freedom 

e-e'+ l-V 

the other being absorbed by the confounding. The reader will do well to set 
out the formal expressions derived from the ordinary 2 x 2x2 design for all 
the degrees of freedom. He will find that the above error degree of freedom 
is twice the difference of the formal expressions for A and N.A, while the 
estimate of A in Table 71 is the sum of these expressions. 

15b. Alternative designs. 

It is instruaive also to consider alternative designs for the above experiment. 

It the mam interest of the experiment is a comparison of the effects of early and 
late application of nitrogen the above design may be considered unsuitable in 
that only one half of the plots contribute information on this point. An alter- 
native set of treatments would be 

e, /, en, en', In, In' 

one of each of the duplicates being omitted. 

The estimates of the treatment effects will then be those given in Table 72. 


Table 72. Partition of degrees of freedom. 


Effect 

N 

A 

L 

N.L 

A.L 


Estimate 

I(en+ «i'+ /n+ In' - ae - it) 
lien - en'+ In - In') 

|(«i+ en' ~ In- ln'+ e-l) 
|(«i+ eti -In-ln' - 2e+ 2I) 
i(ra - en' - Ih+ In') 




70 


Another design including the same treatments is that given by the 2 x 2 x 2 
design containing factors n early, n late, and time of lifting. The treatment 
combinations will then be 

e, /, en, In, en', In', erm'. Inn' 

Here again only half the plots enter into the comparisons on time of 
application, but one quarter of Ae plots receive a double dressing of nitrogen, 
thus giving an estimate of the curvature of the response curve. The appropriate 
partition of the degrees of freedom is given in Table 73. 


Table 73. Partition op degrees op pbhedom. 

Estimate 


Effect 

Response to double dressing {N') 
Curvature (N") 

Time of application {A) 

Time of lifting (L) . . 

LN' 

LN" 


bin -e~l) 

btn' -en-bi-en' - !»'+ e+ /) 
|{en+ In - en' - bi') 

\enn' - btn'-V en - /«+ en' - bt'+e-l) 
\enn' - btn' -e+l) 

(em' ^btn' - en+ In - en'+ bt'+ e-l) 
(en - ii - en'+ bi') 


If the formal three-factor interaction is confounded this is equivalent to 
confounding L.N". If the formal two-factor interactions between time of liftii^ 
and n early, and time of lifting and n late, are also confounded in their turn, 
each of the three equally frequently, two-thirds the relative information on 
L.N' , L.N" and L.A will be obtained. The above two-factor interactions are, 
in fact, i{LN' + L.A) and \{L.N' - L.A). 


15c. 3 3 3 design including quality differences. 

If we wish to experiment on three forms of nitrogen, each form being at 
three levels, in conjunction with three levels of phosphate, the ordinary 3 3 * 3 
design will give three sets of three identical treatment combinations. 

The partition of the treatment degrees of freedom (including dummies) 
will therefore be as follows : 

N 2 g 2 Q.N.P 4 

P 2 Q.N 2 Error 6 

N.P 4 Q.P 4 

N, P and N.P are estimated in the ordinary manner from the 3 x 3 table for 
n and p. Q and Q.N wiU be estimated from the 3 x 2 table for ?oi 
and «i and n^ («o being omitted). 

It may be reasonable to suppose that the differences due to qiwlity at e 
higher level of n are double those at the lower level. .i,_ 

efficient estimates of the quality differences in units of the differences 
lower level of n will be given by i the differences of 

«i?o + 2na?o. + 2n2?i. + 2rt2?3 

meaned over all levels of p. Deviations from this supposed type of quality 



71 


effect, i.e. the interaction Q.N, will be given by the differences of 

which are orthogonal to the above differences. (The reader will find it 
instructive to take some numerical example and check that the sums of squares 
for Q and Q.N, calculated from the above expressions, total to the sum of 
squares for the 3 x 2 table less the sum of squares for the ~ component 
of AT.) 

Similarly the interactions Q.P and Q.N.P will be given by the interactions 
of the two 3 X 3 tables containing the values of the above expressions for 
all levels of p. 


If the experiment is arranged in blocks of nine plots, the ordinary type of 
3x3x3 confounding being employed, it will be found that both Q.P and Q.N.P, 
if calculated as above, will be affected by block differences. The simplest 
procedure is to construct the standard 3x3 table for q and p, including the 
dummy treatments. The quantities in this table will be free from block effects, 
and consequently the 4 degrees of freedom for interactions will be compounded 
of Q.P, Q.N.P and certain error components. They will therefore serve to 
test for interaction between q and p. 


We can, however, improve on this procedure by constructing 33x3 table 
of the quantities o j j 

[«iMo]+ [n.Mo] + i 5o(«o) 

etc., or better (if the quality effect is of the type considered above) of the quantities 

[»! po go] + 2[«2 Po go] + J SoM - i Sa(no), 

5 o(no) being ^e suin of the plots in blocks containing neither «ipo?o nor 
«a;>o 9 oi and Aafnoj.bemg the similar sum in blocks containing rtopr^Q.. Both 
these sets of quantities are orthogonal to blocks and to the main effects and the 
other two-factor interactions, and there is little loss of information. 

in thought that the three-factor interaction could be dealt with 

o Ih! analogous expressions are not orthogonal 

nri . t- terms. They wiU. howev« 

0™aSToplin ^ three-fartor interaction, though the tests^ of significance 

Kce^iy^s^btlaSn.' squarroannot 


factoJ bteSon. fn th ^ ® 

aSbk S^n^lvt- 1 ^ "T not considered 

mav be procedure appropriate to the ordinary 3x3x3 design 

degC of freedomlrorir^or”"^ ^ 

consult (3)^d (S\ wbprA ^ ts mterested m the general problem should 
dm “'“''•‘I® '™lved' for some examples of 


72 


i6. Arrangements with split plots. 

i6(2. Structure and analysts of split-plot designs. 

An experiment of any design may have its plots divided into two or more 
parts for subsidiary treatments. This procedure is of practical utility when 
treatments are included which are of such a nature that they necessitate large 
plots, as for example may occur in combined varietal and manurial trials, in 
which it is often inconvenient to use such small plots for the varieties as are 
practicable for the fertilizers. 

The use of split-plots in randomized block experiments, however, results 
in a loss of information on the whole-plot treatments (with a compensating 
gain on the sub-plot treatments and their interactions with the whole-plot 
treatments), compared with the information which would be obtained in an 
ordinary factorial design using the same sub-plots, even without confounding, 
and the use of split-plot designs should therefore not be resorted to without 
good practical reasons unless the effects of the treatments to be associated with 
the whole plots are not of primary importance. On the other hand if the use 
of an ordinary factorial design would necessitate an arrangement in randomized 
blocks, whereas the use of split-plots enables a Latin-square design to be 
adopted for the whole^lot treatments, the latter design does not necessarily 
result in any loss of efficiency even on the whole-plot comparisons, owing to 
the generally higher efficiency of the Latin square. 

The formal analogy between split-plot designs and ordinary confounded 
experiments will be immediately apparent. In split-plot designs main effects 
are confounded, instead of high-order interactions, the whole plots being 
analogous to the blocks of an ordinary confounded experiment. Analytically 
the important difference is that whereas in confounded experiments the small 
amount of information on the confounded interactions accruing from mter- 
block comparisons is ordinarily ignored, in split-plot experirrients the iruormation 
from whole-plot comparisons is retained, so that in all split-plot designs there 
are two different errors, one relating to the whole-plot comparisons and the 
other to the sub-plot comparisons. 

The analysis of split-plot experiments is formally simple. The analysis 
of variance is divided into two parts. The first part is calculated from e 
yields of the whole plots, and furnishes errors and tests significance tor the 
whole-plot treatments, exactly the same procedure being followed in a 
ordinary randomized block or Latin square arrangement. The second part i 
calculated from the yields of the sub-plots, deducting those pam of the sum 
of squares which have already been accounted for m the analysis of the 
plot^ This is equivalent to analysing the deviations of the sub-plots fro 

their respective whole-plot means. 

In order to make the mean squares of the two parts of the * f 

it is customary to work both parts in units of a single sub-plot. 1 h 



73 


squares of the first part (as calculated from the whole-plot totals) will therefore 
be divided by an additional factor equal to the number of sub-plots in a whole 
plot. In calculating the standard errors applicable to the total yields of whole 
plots the whole-plot error mean square must consequently be multiplied by this 
factor. 

In the special case in which the whole plots are split into two parts only 
the differences between the pairs of sub-plots may be analysed directly in exactly 
the same manner as the totals of the pairs. The sums of squares from these 
differences will then also be divided by an extra 2. One extra degree of freedom 
representing the mean difference, i.e. the main effect of the treatment for which 
the split is made, and corresponding to the correction for the mean in the 
analysis of the totals, will be included in the analysis of the differences. The 
calculation of the total sum of squares of the experiment gives a check on the 
calculation of the totals and differences of the pairs and their sums of squares. 


Many useful extensions of the split-plot type of design are available. In 
general, plots may be split into any number of units, and the resultant sub-plots 
may if desired be subjected to a further split, and so on indefinitely. Correspond- 
ing to each split a different estimate of error will appear in the analysis of variance. 


The whole plots may be arranged in either randomized blocks or Latin 
squares. The treatments of the sub-plots will ordinarily be arranged at random 
within each whole plot. If confounding is resorted to it is not necessary to 
include all the sub-plot treatments in every whole plot. Designs of this type 
are exactly parallel to the more complex types of confounding already discussed, 
with main effects substituted for one or more of the confounded interactions. 


Furthermore in certain cases it is possible to impose Latin-square restrictions 
on sets .of sub-plots. Such designs are parallel to the • designs already given 
the name of quasi-Latin squares. By replacing interactions by main 
etterts such squares are seen to yield a number of designs in which whole rows 
or both rows and columns are subjected to different treatments, most of the 
interactions of Latm-square treatments with these being determined with 
luU precision. Quasi-Latm squares which have both rows and columns subjected 
rows or conveniently be called /.W squares, while if either 

sarares m"*’ half-plaid 

mportantlpplicatiof ‘™ls is a further 


howevS^we'lni '’c given at the end of the section. First, 

blocS ^ ‘l“ig" in randomized 

i6h. Example : a varietal and mammal trial on oats 

The pUn and‘vieldVof\h?"i-T .’’T 8''’™ ‘n Section pa. 

of variance "n Table 75. P'”'^ ■" 74. the analysis 


74 


Table 


Table 74. VARiFrAL and manurial trial : plan and yields in ^ lb. 


oj 


Vt 


Va 


V, { 


t’l 


Vt 


Vt 


{ 


Vt 


Vt 


f ns 156 

na 118 

1 rti 140 

no 105 

r nt 111 

ni 130 

l «5 174 

157 

r no 117 

nt 114 

l tla 161 

ns 141 


r n. 

104 

no 

70 

1 «, 

89 

ns 

1 17 

r 

122 

no 

74 

1 

89 

na 

81 

I — 

103 

no 

64 


132 

ns 

133 


ni 

108 

na 

126 

ns 

149 

no 

70 

ns 

144 

nt 

124 

na 

121 

no 

96 

no 

61 

ns 

ICO 

. nt 

91 

na 

97 


RoW 3 


na 

109 

ns 

99 

no 

63 

nt 

70 

no 

80 

na 

94 

ns 

X26 

ni 

82 

nt 

90 

na 

100 

«3 

116 

no 

62 


Vt 


Vt 


vt 


ns 96 

no 60 

na 89 

nt 102 

na 112 

ns 86 

no 68 

m 64 

na 132 

ns 124 

m 129 

no 89 


} 


Vt 


Vi 


V$ 


na 

118 

no 

53 

ns 

”3 

nt 

74 

ns 

104 

na 

86 

no 

89 

m 

82 

no 

97 

m 

99 

na 

119 

nt 

121 


Vt 


Vt 


Vt 


Area of each sub-plot: 1/80 acre. (28.4 links x 44 link rows.) 

75. Varietal and manurial trial: analysis op variance (sub-plot basis). 


Correction for mean 

: : 


Sub- 

plots 


Total 

Nitrugen 
N X Varieties 
Error 

9 

Total 


D.F. Sum of squares 

77833^-‘^ 

5 1587528 

1786.36 
6013.30 


2 

10 

n 


3 

6 

45 

7> 


23^494 

20020.50 

321-75 

7968.76 

51985.95 


Mean square 

3175.06 
893.18 

601.33 


6673 . 50 

53-63 

177.08 





76 


The sums of squares for varieties, nitrogen, and their interactions are calculated 
from the two-way table (Table 34) in the manner explained in Section ga. 
The sum of squares for blocks is calculated from the block totals in the ordinary 
manner, dividing by 12 after squaring, and the total sum of squares between 
whole plots is calculated from the whole-plot totals, dividing by 4 after squaring. 
The total sum of squares for the whole experiment is calculated directly from 
the yields of the 72 sub-plots. The whole-plot error is then obtained by 
subtraction of the sums of squares for blocks and varieties from the total sum 
of squares between whole plots, and the sub-plot error is obtained by subtraction 
of total and the sums of squares for nitrogen and the interactions from 
the total sum of squares for the whole experiment. The formal analogy of this 
analysis with that of Table 12 should be noted. 

It is immediately clear that the effect of nitrogen is definitely significant, 
but that the varietal differences do not approach significance. The deceptive 
appearance of the table of the yields of the treatment combinations (Table 76) 
in this respect should be noted. Here, although the differences between the 
varieties are not significant, the varieties fall in the same order, Vi, O2, 03, at 
each level of n. This is characteristic of split-plot experiments in which the 
whole-plot error is substantially greater than the sub-plot error, being due to 
the fact that the same whole-plot errors affect all levels of the sub-plot treatments. 

In the present example the interactions mean square is very decidedly 
below expectation, but not quite significantly so. Had it been significantly 
below expectation, this could of course only have been due to chance, unless 
there were some error or defect in the statistical analysis : for this reason if 
significantly sub-normal r^ults occur repeatedly in any type of work the statistical 
procedure should be reviewed, both in its numerical and theoretical aspects. 


j 6 c. Cakulation of standard errors. 

Since there are two different errors applicable to whole-plot and sub-plot 
comparisons respectively, the calculation and use of the standard errors applicable 
toiht yield totals of Table 34 require a little care. The varietal totals are 

^ whole-plots (= 24 sub-plots) and their standard error is therefore 
^irom the whole-plot error mean square) 

rp. , V^x 4 X 601.33 = ^24 X 601.33- I20.I 

sub-plots, and their standard error is therefore 
tirom the sub-plot error mean square) 

TL , . ViS X 177.08 = 56.4 

k 6 sub-plots, and in any 

^rSer hm interactions 

standard err^'r 

q,,.! ^ V6 X 177.08= 32.6 

table 0? between two values in the same line of the 

comparisoT^aH^ ^ ^ same Une, or any 

varie^and^iogL!^^ components of this type, and any interactions between 


76 


The conversion factor for the body of the table is 80/112 x 4 x 6 and those 
for the margins are ^ and J of this. The final table of results is shown in 
Table 76. 

Normally it will not be necessary to make comparisons between values in 
the body of the table which include any component of the mean varietal 
differences, and therefore in presenting the results it will usually be sufficient 
to give only the above three standard errors. 


Table 76. Mean yields op varietal trial in cwt. per acre. 



no 

ni 

na 


Mean 


Vi 

181 

16.0 

19.8 

21.2 

17-41 

1 


ISI 

17.6 

20.5 

22.3 


^±0.894 



19.4 

20.9 

22.6 

19. 6J 

1 

Mean 

14. 2 

17.7 

20.4 

22 . Q 

18.6 



± 0.560 

S.E. of body of table (interactions and n effects only) : ± 0.970. 


A comparison of this type may be required, however, when combining the 
results of experiments. We might, for instance, have a series of smaller trials 
on the same three varieties conducted at only two levels of nitrogenous manuring, 
0 and 0.2 cwt. N per acre, and in the interests of uniformity we might then 
desire to abstract the mean of «o and from the results of the experiment under 
consideration. The standard error of these means can be derived as follow^ 
Calculate the variance (the square of the standard error) of the mean of each 
pair of values from the standard error given in Table 76 for the body of the 

table. This is J (0.970)^ - 0.470 

Also calculate the variance of the varietal means from this standard error, and 
subtract this from the actual variance of the varietal means given in the table. 
This gives j ^ _ 0 <>.564 

which is the additional component of error variance due to whole plots, aqo 
these two variances together 

0.470+0.564=1.034 . 

and take the square root, 1.017, which is the required standard error. e 
point of this calculation is that the additional component of error due to w 0 
plots is not increased by taking a mean over some instead of all the sub-p 0 
in a whole plot. 


i 6 d. Efficiency. . . 

It is immediately apparent that the whole plot 
than the sub-plot comparisons involving the same f 

of the error variances being 601.33 : 177.08= 3.40 : i. If instead ® ^ 

varieties to whole plots we had completely randomized all 12 combin 

varieties and amount of nitrogen there would lacing 

expected value of this error can be found by the method of Section 7 » P. > 
each treatment mean square by the corresponding error mean square ( Vh 






77 


This gives an error mean square of 254.22, so that the precision of the varietal 
comparisons would have been increased by complete randomization in the ratio 
601.33 : 254.22= 2.37, while the precision of the nitrogen effects and its inter- 
actions with varieties would have been decreased in the ratio 177.08 : 254.22 = 0.70. 


Table 77. Calculation of error with complete randomization. 




D.F. 

Sum of squares 

Mean square 


Blocks 

5 

15875.28 


1 

'Whole plots . . 

12 

7215.96 

601.33 

1 

Remainder -j 

Sub-plots 

54 

9562.32 

177.08 


Total within blocks . . 

66 

16778.28 

254.22 


If the differences between varieties and the effects of nitrogen are of equal 
importance, then a completely random arrangement will clearly be the better, 
if not precluded by practical difficulties of sowing, etc. In certain cases, 
however, it may be that one set of main effects is of less importance than the 
other set and the interactions of the two sets. Thus, for example, the choice 
of variety might be dictated by other considerations than those of yield, in 
which case the primary function of the above experiment would be to determine 
the response to nitrogen and its possible variation from variety to variety. In 
this case the split-plot type of design is most appropriate. Similarly in an 
experiment including artificial fertilizers and dung there may be no particular 
point in determining with high precision the response to the dung (which is 
likely in any case to be of uncertain composition, and will certainly be applied 
in practice if available) though the variation in response to artificials in the 
presence and absence of dung may be of vital interest. 

j6c. Confounding of interactions in split-plot designs. 

In addition to confounding the main effects of the whole-plot treatments, 
we may confound one or more interactions between the sub-plot factors with 
whole-plot differences, thus reducing the number of sub-plots in each whole- 
plot. The possibilities are very numerous, designs being most simply derived 
by applying different treatments to the blocks (now called whole plots) of 
ordinary designs. Thus in a combined varietal and manurial trial the vanetal 
plots may be split into four for all combinations of the manurial factors n, p, k, 
e two sets of combinations (i), np, nk, pk and «, p, k, npk being assigned to 
different whole-plots, so that N.P.K is confounded with whole-plots. With 
vaneties and 2 complete replications, each replication (12 whole-plots) being 

rrmged m a block, the degrees of freedom in the analysis of variance will 
partition as in Table 78. 


Table 78. Degrees of freedom 

Whole-phis 

Blocks I 

Varieties c 

N.P.K ; 

V.N.P.K .... 5 

Error 


IN SPLIT-PLOT DESIGN. 


Sub-phts 

N,P,K 

N.P, N.K, P.K . . 

V X manures 
Error 


3 

3 

30 

36 


'Total 23 Total 



78 


We may, however, advantageously confound one of the degrees of freedom 
for V.N.P.K with blocks, thus reducing each block to 6 whole-plots, one for 
each variety, and three for each of the two groups of manurial treatments. 
There will then be 3 degrees of freedom for blocks and 10 for whole-plot error. 
In similar designs with fewer varieties and whole-plots, in which the available 
degrees of freedom for whole-plot error are small, N.P.K and V.N.P.K may 
conveniently be included in the estimate of this error. 

A further and most advantageous alternative is to arrange the whole-plots 
in a 6 X 6 Latin square. To do this, three complete replicates will be required. 
If one of the degrees of freedom for V.N.P.K is confounded with rows it will 
be found that N.P.K must be confounded with columns. Table 79 shows a 
square of this type after randomization, with numbers representing the varieties, 
and a dash the group of treatments (i), np, nk, pk. 


Table 79. 6x6 Latin square with split-plots (6 x 2®). 

1 6' 4' 5' 2 3 

, 3' I' 2' s 6 

2 4' 5' 6' 3 * 

3 5' 6' 4' J 2 

6 2' 3' 4 5 

5 2' 3' 6 4. 


i 6 f. Half-plaid Latin squares. 

The treatment of whole rows or columns of a Latin square with a set of 
subsidiary treatments is a device which is very frequently useful. It is, however, 
only possible with certain special types of square analogous to the quasi-LaUn 

squares already discussed. , , 

At the outset it should be stressed that rows and columns must be completely 
randomized among themselves, as in quasi-Latin squares ydth confounded, 
interactions. The arrangement of the replicates of the subsidiary treatments 
in blocks is therefore not permissible, but the additional degrees of freedom 
for error are a certain compensation for this disadvantage. . 

In order to ascertain if a square of the required type exists it is 
to see if there is a system of confounding which will give two ^ 

degrees of freedom for confounding with rows and columns. If th^ ^ 
confounding of interactions with the rows (these being subjected to the subsidiary 
treatments), i.e. if the number of treatment combinations of rema^ng 
factors is equal to the side of the square, all that ;s required 
which confounds the whole factorial system (including 
in randomized blocks of a size equal to the side of the i^ed 

of the type that has already been enumerated for confounding m randomiz 

^^“'Thus. for example, in an 8 x 8 sguare with the rows 

other of two varieties any one degree of If the other 

with the other factors may be confounded nat^ be 

factors form a 2 X 2 X 2 system then the interaction chosen will naturaiiy 

V.A.B.C. 



19 


If four varieties are included the natural system of confounding with the 
columns will be of the type 

V^.A.C, 


Partial confounding may be resorted to if desired, two sets of this type being 
confounded in a single si^uare. 

The actual construction of any required square can be easily effected. 
All that ■ is necessary is to write down the sets of varietal and treatment 
combinations which confound the chosen interaction degrees of freedom, 
rearranging these sets so that the cross grouping in rows forms sets which each 
contain all combinations of the other treatments but only one variety. 

Table 8o shows an 8 x 8 square for four varieties and a a x 2 x 2 treatment 
system. The above set of interactions is confounded with the columns. (In 
order to exhibit the structure the rows and columns have not been randomized.) 
Such a square will not provide a very precise varietal test, but will, furnish 
accurate information on possible interactions between the varieties and the 
other treatments. 


Table 80. 8x8 halp-puud square for four varieties. 



Sunilar squares of other sizes are possible. Thus a 6 x 6 square may 
mclude two or three varieties in addition to the six treatment combinations 
lonnmg a 3 x 2 system (factors a and b). If there are two varieties the arrange- 
*”7^4 D required, partially confounding V.B (| information) 

and V.A.B (f mformation). If there are three varieties one of the arrangements 
ot be^on 13/ will be required, or if two squares are available both arrangements 
may be used, giving | information on V.A. ® 

If there is confounding of interactions as well as subsidiary treatments with 
me rows, the construction of the squares requires a little more care. Thus 

annlSw a 3 x 3 x 3 system of treatments and 3 subsidiary treatments 

T,Ki one of the sets of confounded degrees of freedom shown 

m i able 43 would have to be adopted for the columns, and a set of the type 

f,, V, AM.C, V.A.B.C, 

tor the rows. 



80 



Table 8 i. 

A 

9x9 

HALF-PLAID SQUARE. 



b 

r 8 

n 

91 

95 

99 

'■4 

P7 

pz 

pfi 

a 

r 6 

n 

98 

93 

94 

rz 

PS 

P 9 


a 


g 6 

Pi 

p% 

PS 

97 

n 

'S 

^9 

b 

?4 


P9 

p^ 

PS 

93 

r 6 

n 

rz 

c 

ri 

n 

96 

97 

92 


PS 

Pi 

pi 

c 

1 PS 

P9 

n 

n 

rfs 

/•i 

94 

98 

93 

a 

PI 

p 2 

n 

'■4 

r 8 

pt 

99 

91 

95 

c 

V) 


pz 

/>6 

PI 

95 

r 8 

»’3 


b 

PS 

Pi 

n 

^9 

ri 

pi 

92 

96 

97 


Table 8i shows a square (randomized) of this type. This design has 
recently been proposed for a rotation experiment on sugar-cane, including 
3 varieties (p, q and r), 3 quantities and 3 forms of nitrogenous fertilizer 
(combinations 1—9) and 3 levels of irrigation {a, b and c). It is intended that 
two squares should be laid down at each place, in different phases of the 
rotation, and that the experiment should be conducted at two or more places. 
The following sets of keys for the combinations 1—9 (Table 82), together with 
re-randomization, will serve to generate four squares confounding different sets 
of three-factor interactions. 


Table 82. Amount and type of fertilizer. 


Amount of 
fertilizer : 

I 

0 I 2 

II 

0 I 2 

III 

0 1 2 

IV 

0 I 2 

1 

fi 

1 4 7 

1 7 4 

I 3 2 

1 2 3 

Type of \ 

2 

2 5 8 

3 9 6 

4 6 5 

789 

fertilizer 

^3 

3 6 9 

2 8 5 

798 

4 5 6 


With equal representation and no dummy treatments, halt intorniation wouia 
be obuined on the three-factor interaction of varieties, type and amount ot 
fertilizers and three-quarters information on the other three-factor interactions. 
The existence of dummy treatments will modify these fractions somewhat. 

The experiment originally suggested was one involving nitrogenous tertiiizer 
only, but enquiry elicited (i) that the chief interest of the station was m varieUes, 
(2) that irrigation was likely materially to affect the optimal level of manuring, 
and possibly the response to different forms of manuring, and (3) that vaneti^ 
had already shown differences in their behaviour on good and Poor soils an 
therefore might be expected to respond differently to manunng. , 5 . 
probable, tol, that varieties will behave differently 

of irrigation. A factorial experiment is therefore essential if ^ 

real vdue is to be obtained. A half-plaid square is eminently suitable, 

it would be exceedingly difficult to irrigate single plots ‘‘■“f ^ a 

As a further example the reader may construct ^ x 8 square witn a 

a X a X a X a system 0/ treatments and two subsidiary “eatmen s- 
also construct Let of 4 x 4 squares for four varieties, ™ ^ ‘ “Ss 
(a X a) within the squares, sacrificing one-third the f . squares 

between varieties and other treatments ; and also a similar + ^ ^ 
for two varieties, retaining full information on all two-factor interactions. 



81 


i6g. Plmd squares. 

Instead of confining the confounding of main effects to rows only, different 
sets of main effects may be confounded with rows and with columns. Thus 
columns might be assigned to different varieties and rows to different cultivations. 
Upon randomization a typical Scotch plaid pattern will result. 

Table 83 shows an example (before randomization) of this type of arrange- 
ment, comprising three varieties, three cultivations and 33x3 system of 
treatments within the square. The following degrees of freedom are confounded : 
Rows : U, A.B.V (Y), A.B.V.V (4 d.f.) 

Columns: 7 , A.B.V (X), A.B.U.V (4 d.f.), 
the four-factor interactions being those derived from the interaction of the 
other confounded sets. The partition of the degrees of freedom will be that 
shown in Table 84. The remainder terms contain three- and four-factor 
interactions only. 

Table 83. A 9 x 9 puud square. 

t )0 t)| t >3 

[168924 f ~^ I 

573168924 

I9 2 4 5 7 3 168 

[816492357 

357816492 

U 9 2 3 5 7 8 1 6 

[681249735 

735681249 

L2 4 9 7 3 5 6 8 I 


Table 84. Degrees op freedom in the 9x9 plaid square. 


Rotvs 
U .. ..2 
Remainder 6 

Columns 

y .. ..2 

Remainder 6 


Square 

A 2 

B 2 

Two-factor interactions. . 24 
Remainder 36 


Total 


80 


square TOMoundS^^^^* construct the 8 x 8 

Rows; [/, V.A.B, U.V.A.B, 

Columns: V, U.A.B.C, U.V.A.B.C 

M. Use of Latin squares with split plots in varietal trials. 

comnnri!!? which docs not include any other factors all 

oTES r 'he varities U be sown 

panted) m approximately square plots small numbers of varieties {up to 


82 


8 or so) can be conveniently arranged in Latin squares, while if the numbers 
are large (25 or over) the quasi-factorial designs described in the next section 
are suitable. In the intermediate range (10 to 24), Latin squares with split 
plots and Graeco-Latin squares (described below) provide a useful set of designs. 

In a split-plot Latin square for 14 varieties, for example, the varieties are 
divided into 7 pairs, these pairs being arranged in a 7 x 7 Latin square, one of 
each pair being assigned at random to one half of each whole-plot. The analysis 
of variance will, as usual, be divided into two parts, the partition of the degrees 
of freedom being that shown in Table 85. 

Table 85. 7x7 split-plot Latin square : partition of degrees of freedom. 


Whole plots Sub‘plots 


Rows 

.. 6 

Varieties 

•• 7 

Colutnns 

.. 6 

Error (6) 

.. 42 

Varieties 

.. 6 



Error (a) 

.. 30 

Total . . 

49 

Total 

. . 48 




There are two types of varietal comparison, one between varieties forming 
a pair, and the other between varieties not forming a pair. These have different 
errors, that of the former being calculated from the sub-plot error variance {b), 
and that of the latter from the mean of the two error variances {a) and {b). 
More generally, if each whole-plot is subdivided into k sub-plots, the error 
variance of any two varieties not occurring in the same set of k is given by the 
weighted mean of the variances (a) and (6), the weights being in the ratio i : k-i. 


i 6 i. The Graeco-Latin square. /^\ 

The main objection to the above type of design is that if the errors ( ) 
and ( 6 ) are very unequal the comparisons between varieties in the same set 
and between varieties in different sets are by no means equal in accuracy. 
An alternative design, which overcomes this disadvantage at the expense 01 
certain addition complication in the analysis, can be derived from a Oraeco- 

ATraeco-Latin square consists of a pair of superimposed L^tin squares 

one fomted of Latin, and the other of Greek letters “"‘*‘“3 

that every Latin letter occurs once and once only with La,in 

vice versa The two squares are thus mutual y orthogonal, and ^ Graeco Lt 

square is consequently derivable from any pair of squares of “ 
cVco-Latin squares'are known to exist for all numbers f 
which are not\ multiple of 4. Of th^e latter "““ber ony 6 has 
exhaustively investigated, For this number there is Square to 

If we take the Latin and Greek letters of a ^ 

represent varieties (or other treatments) a design similar ^ 

sqLre with split-plots results. The usual ™<i°™XrandSion of the 
adopted, i.e. randomization of rows and columns and "’should also be 
Greek and Latin letter within each pau of plots. The letters shou 

assigned to the varieties at random. Table 8 o snows 277 & 

randomization. 



S3 


Table 86. 7x7 Graeco-Latin square. 



The analysis can be effected by forming two tables, one of the sums and 

Me of the differences of the pairs of plots. These should be set out as in 
Table 87. 

Table 87. Analysis of a Graeco-Latin square. 



Table 88. 7x7 Graeco-Latin square : partition of degrees of freedom. 


Table of nans 
Rows of square 
Columns of square 
Latin letters . . . . 
Greek letters . . . . 


Etior(a) .. .. 




Table of differences 
Total (Latin-Greek) . . i 

Latin letters 6 

Greek letters 6 

Error (i) 36 


48 Total 45 

of “ Table 88. Sums 

and thoK representS^hv ?r®r ^ represented by the Latin letters 

and are deriJedfrom the 

The ■■ inteiactions°”of “«* differences, 

■actions ot both tables give the estimates of error (a) and (4) between 
























” whole plots ” and " sub-plots ” respectively, corresponding to the errors (a) and 
(b) of Table 85. Thus estimates of the two types of error are separately obtained. 

If the mean yields of the different varieties are taken as estimates of the 
varietal differences the error variance of the difference of Uvo varieties in the 
same letter group (i.e. both Latin or both Greek) is, as before, derived from 
the mean of the two variances (fl) and (6), while the error variance of the difference 
of two varieties in different letter groups is derived from a weighted mean of 
the two variances, the weights being in the ratio p-i : /) + i. The mean yields 
may be immediately obtained from the sum of the two sets of column totals, 
and the difference of the two sets of row totals, of Table 87. 

It is worth noting that if the two error variances (<2) and (6) are widely 
different more accurate estimates of the varietal differences may be obtained 
by taking a weighted mean of the estimates derived from the sum and difference 
tables of Table 87. 

l 6 j. The hyper-GraecO'Latin square. 

Similar designs with the whole plots split into three or more parts may be 
constructed by the use of three or more squares from an orthogonal set. Such 
designs may be called hyper-Graeco-Latin squ^es. 

The analysis of variance follows lines similar to that of a Graeco-Latin 
square, but the sums of squares cannot be derived from two-way tables. The 


Table 89. Analysis of a hyper-Graeco-Latin square. 

Latin letters 

Variety totals : [a] [^] M 

Whole plot totals : [«•«] [tcs] [»e] 


k[a] - [tc.) k[b] - [wj] *(f] - [»<] 

simplest procedure is to set out the varietal totals for each group of letters (Latin, 
Greek, etc.) as in Table 89, and also the corresponding totals of 
containing the varieties <2, b, etc. (denoted by [wj, [tcj], e^.). The difference ot 
the second line from k times the first line is then taken. The second line (of the 
Latin letter table) provides estimates of the differences of the Latin letters derived 
from differences of whole plots, while the third line provides estimates oenved 
from sub-plot differences. The sums of squares of the deviations, divided by 
pk and by pk{k-i) respectively, give the two si^ of squares correspond! g 
to the two sets of <> - i degrees of freedom for the Latin letters in the who -p 
^Hub-plot parS of the^ malysis respectively The sums o square for * 
Greek, etc, letters are derived simUarly. The k - . depees of for th 

contrasts of the k groups of letters are derived f™"* the con^tt ^ the to 
of the first line of Table 89 and the corresponding totals for the Greek, • 

The error variance of the difference of the m<^ yields of ‘3 

the same group is derived from a weighted mean of the vanances ^o‘e and 
sub-XV weights being in the mtio i : A- i. and that of vaneti^ not 
in the same group is derived from a second weighted mean, the weig 

in the ratio /» - i : /> (A - i) + i. 



85 


17- Varietal trials— quasi-factorial designs. 

Plant breeders frequently wish to compare a large number of new strains— 
numbers such as loo to looo are by no means uncommon. With such a large 
number of varieties arrangements in randomized blocks including all the varieties 
will usually be ineffective in eliminating fertility differences, while Latin squares 
are clearly impossible. The classical way of arranging such trials is by the use 
of “ controls,” i.e. plots growing a standard variety. These may be arranged 
either systematically or at random. Recently, however, new methods of 
arranging such trials have been devised, which make possible the use of blocks 
containing only a few plots, or, what is even more useful in many cases, the 
use of Latin squares. Most of these designs may be classified as “ quasi- 
factorial,”* since their structure can be derived from confounded factorial 
designs. Such designs are always more efficient than designs involving controls, 
and will also be more efficient than designs in ordinary randomized blocks when 
there are any considerable inequalities of fertility. 

It would take us too far afield to describe all these designs in detail. We 
shall therefore merely give an outline of the more useful types, without any 
attempt to describe the methods of computation. The reader who wishes to 
utilize the designs should refer to the original papers, (ii), (12) and (13), where 
he will find a full description, together with numerical examples of the 
computations. 

rya. The lattice.] 

This is the simplest of the quasi-factorial designs in randomized blocks. 
If we have, say, 90 varieties, numbered 1—90, the rows and columns of the 
two-way table (Table 90) : 


Table 90. Sets for lattice design. 


I 

II 

21 

31 

+1 

5« 

61 

V 

81 


12 

22 

32 

42 

52 

62 

72 

82 


3 

•3 

23 

33 

43 

53 

63 

73 

83 


4 

»4 

24 

34 

44 

54 

64 

74 

84 


5 

J5 

25 

35 

45 

55 

75 

85 


6 

16 

26 

36 

46 

56 

66 

76 

86 


7 

»7 

27 

37 

47 

57 

67 

77 

87 


8 

18 

28 

38 

48 

58 

68 

78 

88 


9 

19 

29 

39 

49 

59 

69 

79 

89 


10 

20 

30 

40 

SO 

60 

70 

80 

90 


* - -- vy yv 

divide the varieties into two groups of sets containing 10 and 9 varieties each 
respectively In a lattice design the varieties in each set are arranged in the 
field in randomized blocks, each group of sets being replicated equally Thus 

fr-P each eroup of sets will be replicated 1 times' 

th«e being 27 blocks of 10 plots eaci, of which three will Contain varSS 

21, 31, 

TThe name is 


86 


The design is parallel to a factorial design, each variety being represent- 
able by a combination of two factors, one at 9 levels corresponding to rows, 
and the other at 10 levels corresponding to columns. In the replications of the 
first grouping the main effects of the first factor are confounded with blocks, 
in the replications of the second grouping the main effects of the second factor 
are confounded. The main effects of one or both factors will enter into the 
comparison of any pair of varieties, and therefore there is some loss of information 
on all such comparisons, comparisons between varieties which have a set in 
common being slightly more accurate than comparisons which have no set in 
common. This loss of information must be taken into account when assessing 
the efficiency of the design. The efficiency factor* for a /> x ^ lattice is 

P9-^ 

P9+P+9-3 

In the most useful case, when p= g, i.e. when the sets form the rows and 
columns of a square, it is 

P+ I 
P+ 3 

It may be noted that in any case q should not differ widely from p. 


If p and q are small the efficiency factor becomes somewhat small. For 
25 varieties, for example, it is S - ^ This means that if there were no reduction 
in error variance per plot by reduction of block size from 25 to 5 plots, a lattice 
design would only give | of the information that would be given by an ordinary 
arrangement in randomized blocks of 25 plots. Of course it rarely happens 
that there is no reduction in error variance, though the reduction is sometimes 
small. Moreover there is no reason why the information accruing from the 
block comparisons should not be taken into account, provided that the experiment 
has sufficient replications to give an adequate estimate of error for the inter- 
block as well as the intra-block comparisons. This procedure will recover most 
of the lost information and makes the design much more attractive for a moderate 
number of varieties.! 

In order to utilize the information from inter-block comparisons, and to 
make these as accurate as possible, all the blocks forming a complete re^ication 
should themselves be arranged in a compact block on the ground. Fairs 0 
these replications should contain one replication in each grouping, assignment 
of the grouping being at random within the pair. The sets should be 
at random to the blocks of each replication.: Moreover the numbers of 1 able 
90 (or the position within the table) should be assigned at random to the vane le . 


•Defined as the ratio of the vaiiance of a varietal comparison in a design in 

average variance in a lattice design occupying the same number of plots and having 


per plot. 

tThis procedure is not discussed in the papers referred to above but ‘‘ ' 
matter shortly. In the simplest cases the additional computation required appears ro oe y 

IThis method of arrangement is somewhat different from that of the e.sample of (i i), m which llic use of 
block comparisons was not envisaged. 



87 


I'jh. Triple and balanced lattices. 

If fhe number of varieties is a perfect square, and a square lattice is 
constructed as above, it is always possible to superimpose a Latin square on 
this square. The letters of this Latin square may be used to denote a third 
group of sets, which may be arranged in randomized blocks in the same manner 
as the other two groups. We thus arrive at what may be called a triple lattice. 
It will be noted that all three groups of sets bear exactly the same orthogonal 
relationship to one another, every set of each group containing one and only 
one variety from every set of the other two groups, 

The advantage of introducing a third grouping is that the efficiency factor 
is increased, being instead of ~-~ 

If the number p is such that a full set of orthogonal Latin squares exists, 
further groupings corresponding to these squares may be made. When all the 
p - I squares are used (giving p + i groupings) complete balance is attained, 
comparisons between every pair of varieties being of equal precision. The 

efficiency factor of a balanced lattice is — ^ This corresponds to the fact 


p+ I. 


that in each replication p - i degrees of freedom out of the total of - r are 
confounded, so that the loss of information (blocks being completely ineffective) is 

P - ^ ^ I 

p'^ - i />+ t ‘ 

This is a property of balanced arrangements, which has already been referred to. 

Full sets of orthogonal squares are known to exists for all prime number 
and for p-^ 4, 8 and 9. No such set exists for - 6. For prime numbers the 
method of construction is very simple, each line of the first square being derived 
from the previous line by moving the letters one column to the right, each line 
of the second square by moving the letters two columns to the right, and so on. 
Sete of 8 X 8 and 9 X 9 squares are given in The Design of Experiments (2nd 
edition). The 10 groups for 81 varieties may also be derived by the successive 
transformation given in Section 14c of the square of Table 51. The first and 
second numbers of the treatment combinations and the rows and columns of 
rach square give the 10 different groupings. The transformation given in 
Section 14c for the 8x8 square of Table 33 generates the groupings for 64 
varietiM in a similar manner, except that in the fourth square only the grouping 
given by the columns is required. ^ ® 

*®^^^®.<*®signs only a single replication of each grouping is 

fZTilfbWl f “““ information 

S of precision desired, and will usually 

xceed these minimal requirements except m the case of balanced lattices. 

^^c. Lattice squares. 

the irmm/ f *0 sets of a balanced lattice in randomized blocks 

the groups of sets may be taken in pairs, and for each pair a square marbe 


86 


constructed having its rows formed of the sets of one group and its columns 
of the sets of the other group. If p is odd, ^{p+ i) squares will be required 
for balance, but if p is even each group must be included twice to give p + i 
squares. If the rows and columns of each of these squares be rearranged amongst 
themselves in random order, and the resultant squares set out on the ground, 
we shall have an arrangement which is in essence a set of Latin squares with the 
quasi-factors confounded with rows and columns. 

There is, of course, no absolute necessity for designs of this type to be 
balanced, but the attainment of balance, at any rate when p is odd, does not 
demand an excessive number of replications, and simplifies the computations 
and the interpretation of the results. 

Table 91 shows a balanced set of three lattice squares for 25 varieties (before 
randomization of rows and columns). 


Table 91. Balanced set of lattice squares for 25 varieties. 


Square I 


1 

2 

3 

4 

5 

6 

7 

8 

9 

10 

11 

12 

13 

H 

»5 

16 

17 

18 

»9 

20 

21 

22 

23 

24 

25 


Square II 


1 

13 

25 

7 

19 

20 

2 

H 

21 

8 

9 

16 

3 

15 

22 

^3 

10 

17 

4 

11 

12 

24 

6 

18 

5 


Square III 
I IS 24 8 17 

18 2 II 25 9 

10 19 3 12 21 

22 6 20 4 13 

H 23 7 16 s 


The method of construction of similar sets for other prime numbers should 
be apparent from a study of this table. Sets of squares for 64 and 81 varieties 
are provided by the transformation given in Section 141: of the squares of 
Tables 33 and 51, together with the square formed by arranging the varieUl 
numbers in systematic order, as in the first square of Table 91. 

These lattice squares are particularly attractive, since they enable the 
advantages of Latin square design to be utilized, whereas the comparisons 
within the sets of an ordinary lattice by means of Latin squares instead of 
randomized blocks would require more replications than are usually available. 
The efficiency factor is, however, somewhat low, being 

Pzl 

as is easily verified from the property referred to above. With 25 varieties it 
has the value of f . The average increase in precision with 5x5 Latin squares 
in the Rothamsted experiments has been found to be 2.5 . i, so that the average 
net gain in precision on similar land by the use of lattice squares instead 0 
ordinary randomized blocks for 25 varieties may be expected to be 1.67 : i or 07 
per cent. This average gain will be somewhat increased by utilizing mter-row 
and column comparisons in those experiments in which the land is found 0 e 
very uniform. 


jyd. Three-dimensional lattices. 

Instead of arranging the varietal numbers in a two-way table, as in Tab e 21, 
they may be arranged in a three-.way table, i.e. spatially in the form 0 



99 


or cuboid. A three-dimensional lattice, defining three groups of sets, may then 
be constructed by taking lines parallel to the edges of this cube or cuboid. 
Thus if there are x y x r varieties there will be pq sets of r varieties, pr sets 
of q varieties, and qr sets of p varieties. With p= q= r there will be Aree 
groups of p* sets of p varieties. Thus an arrangement for ^ x ^ x r varieties 
in blocks of^, q and r plots, or for^^ varieties in blocks oip plots, is provided. 
The efficiency factor in the latter case is 

2(p" + j>+ i) 

2/>* + 5^+ II 

Using a three-dimensional arrangement of p^ varieties in the form of a 
cube, we may also obtain three groups of p sets of p'^ varieties by taking layers 
of this cube parallel to each of the faces in turn. The varieties of each set 
may be compared by means of a set of lattice squares, the use of two of the 
three groups being all that is really necessary. We thus arrive at an arrange- 
ment for p^ varieties in p x ^ lattice squares. The efficiency factors are 

p-i + p+ I , p - I p-^ + p+ 1 
p+ I ' p“ + p+ 3 p+ I ' p“ + p + 2| 

respectively, according as two or three groupings in sets of p“ are taken, the 
total number of replications required (p odd) being (p+ i) and i(p+ i) 
respectively. 


lye. Non-factorial designs : balanced incomplete blocks* 

In all the designs so far considered the number of treatment combinations 
IS some multiple of the number of plots in a block or in a row or column of a 
Latin souare, and moreover each replication of the design falls wholly in one 
set of blocks or rows or wlumns. There is a further useful family of designs 
in randomized blocks which does not in general fulfil these conditions. This 
IS the family conforming to the condition that every pair of treatment combinations 
shal occur together in the same number of blocks. These designs are balanced. 
aU treatment comparisons being of equal accuracy. Balanced lattices are 
a}^^^ family, and other members are derivable from certain of the 

discussed. There are, however, many other members 

>l“vable. The series of chief interest to the 
agronomist is a set of deigns forfH/>+ i varieties in blocks of p+ z plots, 

thJoVthe f The structure of this set of designs is dependent on 

discul''rhem IStre^"'^' '5) »nd (la), and we shall not 


•Previously called sytnmeirical incomplete randomized blocks. 


00 


lyf. The introduction of additional treatments in quasi-factorial designs. 

The designs described in this section require a large number of blocks, 
and the possibilities of using these blocks as plots for additional treatments 
should not be lost sight of. If, for instance, there are six replicates of a simple 
lattice design, there will be sets of three blocks containing identical varieties, 
and these might be used as plots to compare three additional treatments and 
to ascertain whether the varieties interacted with these treatments. It will be 
noted that interactions between the additional treatments and the sets of varieties 
will inflate the inter-block error. This source of disturbance can be allowed 
for if necessary, but frequently it will not be sufficiently large to be of any 
moment. 



91 


NOTES 


Note i. NIjmber of figures required in the computations and results. 

It is a common fault in numerical work to retain too many figures both in the results and the 
iniermediate calculations. On the other hand certain calculations require considerably greater 
accuracy than others, e.g. in the correction for the mean in the analysis of variance a large number 
of figures must be retained. There is not space here to give any detailed discussion- of the matter, 
but the following hints may be of assistance. 

(i) Significant figures. 

The number of significant figures is the number of figures counting from the first figure not 
zero and excluding terminal zeros. Thus 237, 0.00237, 23700 all contain three significant figures. 

(li) Observed yields, etc. 

Only three significant figures need be retained if the standard error of a single observation 
is not less than 3—5 per cent, of the mean (as in the yields of field plots). It pays to round off 
if the field results are given to greater accuracy. Fractions are best decimalized, as working in 
units of a quarter or a half of the ordinary units of measurement introduces dangerous possibilities 
of error. When a computing machine is used working means are best avoided, especially if they 
are such as to introduce negative numbers. 

(iVi) Analysis of variance. 

Sufficient figures should be retained in the sums of squares to give four significant figures- in 
the error sum of squares. In cases of doubt the retention of an extra figure or two does not seriously 
increase the work. 

(it) Presentation of results. 

Three significant figures are normally sufficient in agricultural field experiments. In general 
the number of figures required depends on the accuracy of the final results. 

(«) Standard errors. 

A good 10 inch slide rule (three significant figures) will give all necessary accuracy, and is 
vwy coiivcnienti since square roots may be read directly. 


Note 2. Numerical divisors in the analysis of varlance, etc. 

^ square co^esponding to any single’ degree of freedom is obtained by squaring 
T? of «rtam multiples (positive, negaUve and zero) of ie plo^ 

SJS • the special but common case in which the multipliers are all +1, - 1 or o the 
divisor 18 equal to the number of plot yields going to make up Q 

Technically Q is said to be a linear fimetim of the plot yields yi, y^, i,e. 

0^/iyj+/jya+ 

numerical quantities (the above multipliers), so that 

d= /i»+ U*+ 

by .he above rule. “iTe^u^i ror&rt^'Sr!”' “ ‘“™°' 


92 


The estimates of the corresponding effects are obtaiaed by dividing the Q by some divisor Xd 
which depends on the conventions adopted. In the case of main effects and interactions of factors 
at two levels X is equal to a half. With factors at more than two levels X is equal to unity unless 
one or more of the interacting factors is at two levels only (see Section I3f for an example). 

The error variance of Q is equal to d times the error variance of a single plot, and consequently 
the error variance of the estimate is ^ times the error variance of a single plot. 


Note 3. Orthogonal functions. 

If the effects corresponding to two degrees of freedom are estimated from two quantities 
Q and Q’ such that 

Q = It yt+ la )'a+ 

yi+ I'a >2+ 

as in Note 2, the two degrees of freedom arc orthogonal if 

It I't+la /'2+ = 0 

i.e. if the sum of the products of the corresponding multipliers of the plot yields is zero. With 
three degrees of freedom there are three such sums of products, which must all be zero, and so on. 

Similarly two sets of degrees of freedom are orthogonal if the corresponding pairs of P s and 
are orthogonal, provided that no plot yield enters into more than one such pair. 


Note 4. Hints on the use of calculating machines. 

(1) Arrange the computations so as to avoid having to write down intermediate steps: the 
transfer of numbers from the machine to paper, and back again to the machine, consumes a large 
amount of rime, and introduces possibilities of error. 

(2) Always compute sufficiently carefully to avoid mistakes. Checking should be regarded 
as an assurance that no errors exist, not as a method of correcting errors. 


(1) In long computations, such as extensive sums of squares, record the value attained at 
suitable intervals, so as to facilitate the location of possible errors, but do not clear the machine, 

(4) In calculating sums of squares or products accumulate the sum 

possible, even if this sum is already known, either by means of a i on the right of the keyboard, 
or by means of the register provided on some machines for this purpose. 

(5) Partial sums of the multipliers (such as block rotals) may 

sum of the multipliers ot the appropriate intervals, clearing this sum (but not the sum o q 

if convenient. 

(6) In a sum of squares in which the sum is also being accumulated an occasional negative 
value (say - 123) may be treated by the process : 


I 229999999 

123 


151289999877 

the top line of figures being written on the keyboard. If there ^e a not 

negative numbers it is best to square aU the positive can be dealt 

the sum of squares), and then square all the negative numbers. Sums p 

with similarly. 



(7) In covariance work with two variables the two auma of squares and Uau the sum of 
products can be obtained simultaneously by the process : 

1230000456 

1230000456 

01-512901121760207936 

(A 10 X 10 X 20 machine is required for three<figure numbers). If the sums of squares (together 
with the sums] are also calculated separately the sum of products will also be checked (but beware 
of negative numbers and errors of copying from the machine). 

(8) In covariance work with more than two variables one sum of squares and one sum of 
products (or two sums of products) can be obtained simultaneously by writing two variables at 
opposite ends of the keyboard. 

(9) In covariance work with more than two variables the most effective method of checking 
in many types of analysis is to construct an identical table of the sums (r) of the corresponding 
values of each variable. The various sums of squares of the t table provide a complete check, 
by reason of the identity 

j*= (a+ b+ c)*= a*+ A*+ c*+ 2fli+ 2at+ 26c. 

More detailed checks are provided by the identities 

a*+ ab+ ac 
etc. 

(10) If several divisions by the same divisor have to be performed it is best to multiply by 
the reciprocal of the divisor. 


94 


REFERENCES 

BOOKS 

(1) R. A. Fisher. 1925. Statistical Methods for Research Workers. Edinburgh: Oliver and 

Boyd. 6th Edition. 1936. 

(2) L. H. C. Tippett. 1931. The Methods of Statistics. London: Williams and Norgate. 

{3) R. A. Fisher. 1936. The Design of Experiments. Edinburgh : Oliver and Boyd. 
2nd Edition. 1937. 

4 

(4) D. Mainland. 1937. The Treatment of Clinical and Laboratory Data, An Introduction 
to Statistical Ideas and Methods for Medical and Dental Workers. Edinburgh ; Oliver 
and Boyd. (In the press.) 


TABLES 

(5) R. A. Fisher and F. Yates. 1937. Statistical Tables for Biological, Medical and Agricultural 

Research. Edinburgh : Oliver and Boyd. (In the press.) 

PAPERS 

I. On subjects discussed in the text. 

(6) R. A. Fisher. 1926. The Anangement of Field Experiments. Journal of the Ministry 

of Agriculture, Vol. XXXIII, pp. 503-513. 

An account, in non-mathematical terms, of the principles governing experimental 
design. 

(7) R. A. Fisher and J. Wishart. 1930. The Arrangement of Field Experiments and the 

Statistical Reduction of the Results. Imperid Bureau of Soil Science. Technical 

Communication No. 10. • e j ■ a 

A simple explanation of the numerical procedure of the analysis of randomised 

block and Latin square experiments. 

(8) F. Yates. 1933. The Principles of Orthogonality and Confounding in Replicated Experiments. 

Journal of Agricultural Science, Vol. XXIII, Part I, pp. 108-145. . 

An account of the principles underlying the structure of replicated experiments. 

(9) F. Yates. 1935. Complex Experiments. Supplement to the Journal of the Royal Statistical 

Society, Vol. II, No. 2, pp. 181-247. . . 

An outline of the methods of factorial design and an investigation of the gam 

efficiency resulting from confounding. 

(lo) M.M. Barnard. 1936. An Enumeration of the Confounded Arrangement in ffie 2 x 2x2 ... 

Factorial Designs. Supplement to the Journal of the Royal Statistical Society, 0 . 1 

No. 2, pp. 195-202. 

(II) F. Yates. 1936. A New Method of Arranging Variety Trids Involving a Large Number 

of Varieties. Joumd of Agriculturd Science, Vol. XXVI, Part III, pp. 4 * 4 * 45 S- 
See Section 17a, b and d. 



95 


F. Yates. 1936. Incomplete Randomized Blocks. Annals of Eugenics, Vol. VII, Part II, 

pp. 121*140. 

See Section / 7 <* 

F. Yates. 1937. A Further Note on the Arrangement of Variety Trials : Quasi-Latin 
Squares. Annals of Eugenics, Vol. VII, Part IV, pp. 319-331. 

See Section ijc. 

II. On some useful special processes. 

F. Yates. 1933. The Analysis of Replicated Experiments when the Field Results are 
Incomplete. Empire Journal of Experimental Agriculture, Vol. I, No. 2, pp. 129-142. 
The procedure of analysis when one or more plot yields are missing is described. 

F. Yates, 1933. The Formation of Latin Squares for use in Field Experiments. Empire 
Journal of Experimenul Agriculture, Vol. I, No. 3, pp. 235-244. 

F. Yates. 1936. Incomplete Latin Squares. Journal of Agricultural Science, Vol. XXVI, 
Part II, pp. 301-315. 

The analysis of incomplete Latin squares is described. The following cases are 
considered ; a missing row, column or treatment, a missing row and column, or 
either and a treatment. 

III. Sources of experimental material. 

F. R. Immer. 1932. Size and Shape of Plot in Relation to Field Experiments with Sugar 
Beets. Journal of Agricultural Research, Vol. 44, No. 8, pp. 649-668. 

Rothamsted Experimental Station, Annual Reports, 1925-1936. 

Many a«ual examples of factorial design are given in these reports, and the whole 
development of factorial design can be followed. Useful methods of presenting 
the resulte of complicated experiments are exemplified, and some interesting long- 
period rotation experiments are described. 




T 


mm IQBAL LIBAABY 



28714 



REPRINTED BY LrTHOGRAPHY IN GREAT BRITAIN 
BY JARROLD AND SONS LIMTTID^ NORWICH 







•tf 


